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ABSTRACT 



A single quantum state can be shared by many distant parties. In this the- 
sis, we try to characterize the information contents of such distributed states 
by defining the multiparty information and the multiparty squashed entangle- 
ment, two steps toward a general theory of multiparty quantum information. 

As a further step in that direction, we partially solve the multiparty dis- 
tributed compression problem where multiple parties use quantum commu- 
nication to faithfully transfer their shares of a state to a common receiver. 
We build a protocol for multiparty distributed compression based on the fully 
quantum Slepian-Wolf protocol and prove both inner and outer bounds on the 
achievable rate region. We relate our findings to previous results in informa- 
tion theory and discuss some possible applications. 
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ABREGE 



Un etat quantique peut etre partage entre plusieurs entites qui sont spa- 
tialement separes. Dans ce memoire, nous essayons de caracteriser 1'information 
quantique contenue dans de tels etats distribues en definissant et utilisant 
les notions d'information multipartie (multiparty information) et d'intrication 
"ecrasee" multipartie (multiparty squashed entanglement). II s'agit de pre- 
miers pas vers une theorie generate de 1'information quantique multipartie. 

Nous faisons aussi un autre pas dans cette direction en etudiant le probleme 
de la compression distribute d'information quantique. En particulier, nous 
proposons un protocole de compression distribute base sur la version quan- 
tique du protocole de Slepian et Wolf et analysons ses caracteristiques. Nous 
discutons aussi la relation entre nos resultats et les travaux precedents dans la 
theorie de 1'information et soulignons quelques applications possibles de notre 
protocole. 
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CHAPTER 1 
Introduction 

Information theory is one of the most important mathematical theories 
developed in the last century. It finds applications in communications engi- 
neering, computer science, physics, economics, neuroscience and many other 
fields of modern science. Of particular interest are the recent developments in 
quantum information theory (QIT), a discipline which studies the limits that 
the laws of quantum mechanics impose on our ability to store, manipulate and 
transmit information. All information is physical; whether it be the magnetic 
domains of a hard disk platter, the reflective bumps on the surface of a DVD 
or the charge of the capacitors in a stick of RAM, that which we intuitively 
refer to as information must be stored in some physical system [1]. Thus, 
the incursion of quantum physics into information theory is inevitable if we 
want to understand the information properties of quantum systems like single 
photons and superconducting loops. 

Modern quantum information theory has elaborated a paradigm in which 
a set of spatially localized parties try to accomplish a communication task by 
using communication resources like channels, states and quantum entangle- 
ment [2, 3, 4, 5]. Such an approach is now possible because of the substantial 
body of results characterizing quantum communication channels [6, 7, 8, 9] 
and the truly quantum resource of shared entanglement [10, 11, 12]. In this 
new quantum paradigm of information theory, many classical results need to 
be revisited in the light of the peculiar properties of quantum information. 
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In classical information theory, distributed compression is the search for 
the optimal rates at which two parties Alice and Bob can compress and trans- 
mit information faithfully to a third party Charlie. If the senders are allowed to 
communicate among themselves then they can obviously use the correlations 
between their sources to achieve better rates. The more interesting problem 
is to ask what rates can be achieved if no communication is allowed between 
the senders. The classical version of this problem was solved by Slepian and 
Wolf [13]. The quantum version of this problem was first approached in [14, 15] 
and more recently in [5], which describes the fully quantum Slepian- Wolf 
(FQSW) protocol and partially solves the distributed compression problem 
for two senders. 

In this thesis, we analyze the multiparty scenario of distributed compres- 
sion where many senders, Alice 1 through Alice m, send quantum information 
to a single receiver, Charlie. We will describe the multiparty FQSW protocol 
and exhibit a set of achievable rates for this protocol. We also derive an outer 
bound on the possible rates for all distributed compression protocols based on 
the multiparty squashed entanglement. 

The multiparty squashed entanglement (independently discovered by Yang, 
et al. [16]) is a generalization of the squashed entanglement defined by Chris- 
tandl and Winter [17] and has very desirable properties as a measure of mul- 
tiparty entanglement. While there exist several measures for bipartite entan- 
glement with useful properties and applications [2, 18, 19, 20], the theory of 
multiparty entanglement, despite considerable effort [21, 22, 23, 24], remains 
comparatively undeveloped. Multiparty entanglement is fundamentally more 
complicated because it cannot be described by a single number even for pure 
states. We can, however, define useful entanglement measures for particular 
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applications, and the multiparty squashed entanglement is one such measure 
well-suited to application in the distributed compression problem. 

The main results of this thesis are contained in Chapters 4 and 5. Chap- 
ter 4 presents our original results on the multiparty generalization of squashed 
entanglement. Chapter 5 deals with the multiparty distributed compression 
problem and proves inner and outer bounds on the rate region. Before we get 
there, however, we will introduce some background material on classical and 
quantum information theory in Chapter 2. In Chapter 3, we describe some 
important recent results of quantum information theory which form the basic 
building blocks for our results. Finally, in Chapter 6 we take a look at some 
possible applications of the distributed compression results to the black hole 
information paradox. The dependency graph for the sections in this thesis is 
shown in Figure 1-1 on the next page. 

Most of the original results in Chapters 4 and 5 appear in a paper [25] 
co-authored with Prof. David Avis and Prof. Patrick Hayden to which the 
author has made substantial contributions. 
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CHAPTER 2 
Background 

In this chapter, we will present background concepts from classical in- 
formation theory and their analogues in quantum information theory. These 
concepts form the basic building blocks with which we will construct all sub- 
sequent results. Our coverage of the information theoretic topics is far from 
exhaustive; it serves to introduce a minimum prerequisite structure that can 
support the rest of the exposition. For an in-depth view of classical and quan- 
tum information theory the reader is referred to the classics in the fields: [26] 
and [27] respectively. 

2.1 Classical information theory 

In 1948 Claude Shannon published a seminal paper [28] titled "A math- 
ematical theory of communication" which set the stage for what has become 
one of the most fruitful modern mathematical theories. The field of informa- 
tion theory was born out of the need of communication engineers to quantify 
the information carrying capacities of channels and the theoretical aspects of 
data compression. 

2.1.1 Foundations 

At the root of Shannon's information theory is the simplifying assumption 
that information ultimately boils down to the statistics of the symbols used to 
express it. Thus, another name for information theory could be information 
statistics. By focusing solely on the statistics of the symbols, which can be 
described by mathematical equations and axioms in the spirit of Hilbert's 
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program [29], we can dispense with the difficult semantical questions related 
to humans. 

We say that information is produced by a source, which is a random vari- 
able X that takes on values from an alphabet X = {a 1 , a 2 , . . . , a'*'} according 
to some probability distribution Pr{X = x} = p(x). 

Example 2.1. Let X be the outcome of a coin flip. We will denote the alphabet 
X = { 'H\ e T'}. If the coin is fair, then 

Pr{X = a 1 } = Pr{X = 'H'} = 0.5, 
Pr{X = a 2 } = Pr{X = 'T'} = 0.5. 

In this case, all outcomes are equally likely and it is maximally difficult to 
guess the result of the coin flip. 

Example 2.2. Suppose the Canadian border control center receives an hourly 
status message M from a distant outpost. The possible messages are: 

• No one has attacked, which occurs 99.7% of the time: Pr{M = o; } = 0.997 

• The Americans have attacked: Pr{M = a 1 } = 0.002 

• The Russians have attacked: Pr{M = a 2 } = 0.001 1 

In this scenario, one of the outcomes, a , is much more likely than all the 
others. If we were to shut down the remote outpost and instead guess M = a 
every hour, we would only be wrong 0.3% of the time! Of course, implement- 
ing such an approximate border defense system is a silly idea, but in other 
situations an approximate result is just as good as the exact one. 



1 The quoted probabilities may not reflect the current geo-political balance 
of power. 
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Do we learn more information from one outpost message M, or from one 
coin flip XI Using information theory, we should be able to quantify the 
amount of information produced by each source. 

2.1.2 Shannon entropy 

According to Shannon, there exists a single function sufficient to quantify 
the information content of a source. This function is the key building block in 
all of information theory. 

Definition 2.3 (Shannon entropy). Given a statistical source X, over the 
alphabet X with probability function p(x), the quantity 

H(X) = -Y,P(x)log 2 p(x) (2-1) 

is the Shannon entropy of the source. 

The entropy of an unknown source measures our uncertainty about it and 
therefore, it measures how much information we learn, on average, when we 
look at a symbol from that source. The entropy is typically measured in bits 
since we use the base-2 logarithm in the calculation. 

The quantity — ^2 xeX p(x) \ogp(x) also appears in thermodynamics where 
it is known as the Boltzmann-Gibbs entropy function. It is used to denote the 
logarithm of the number of available microstates that are consistent with cer- 
tain macroscopic constraints [30, 31]. Together the entropy, energy, volume, 
pressure and temperature form the macroscopic description of a given ther- 
modynamical system. 

We now revisit the coin flip and outpost message scenarios from the pre- 
vious examples. 

Example 2.4. The entropy of the balanced coin flip is: 

H(X) = -0.5 log 0.5 -0.5 log 0.5 = - log 0.5 = log 2 = 1 [bit]. 
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In other words, we learn one bit of information every time we flip the coin. 
On the other hand, the entropy of an outpost message is only 



H(M) = -0.997 log 0.997 - 0.002 log 0.002 - 0.001 log 0.001 = 0.03222 [bits] . 



Therefore, every coin flip carries about 30 times more information than a 
message from the distant outpost. 

The true power of the information theoretic approach becomes apparent 
when we try to describe very long strings of symbols produced independently 
by the same source. Consider a source X which is used n times to produce 
the sequence X 1: X 2) . . . , X n . We will denote the entire sequence as X n with 
a superscript. We assume that the random variables X i are independent and 
identically distributed (i.i.d.) according top(x). 

We can write down the probability of a given string x n = xi, x 2 , ■ ■ ■ , x n 
occurring as 



since the JQ's are independent. 

Next we ask the important question: 

"How often does the symbol a 1 occur, on average, in a sequence 
of n uses of the source (X 1: . . . , X n )T 
Because every one of the symbols in the sequence has Pr{X = a' 1 } = p(a l ), 
the overall number of a*'s in a string of length n is going to be approximatively 



Pi{X n = x n } 



p{x 1 ,x 2 , ...,x n ) 




(2.2) 



np{a l ). Therefore, on average, the probability of a string x±, x 2 , . . . , x n is 

p(x u ...,x n ) = p(x 1 )p(x 2 ) ■ ■ -p(x n ) (2.3) 
« p(a 1 )---p(a 1 )p(a 2 )---p(a 2 )---p(a lxl )---p(a lxl ) (2.4) 

N v ' V v ' N v ' 

np(a 1 ) times np(a 2 ) np(a\ x \) 



— |^2 log2P( ! 



np(x) 



np(x) 



= 2- nH(x) . (2.5) 

By going from equation (2.3) to (2.4), we have made a crucial change in 
our point of view: instead of taking into account the individual symbols X; L of 
the sequence, we focus on the global count of the symbol's occurrences. In 
other words, we abandon the microscopic description of the string and trade it 
for a macroscopic one in the spirit of thermodynamics. At first, it is difficult 
to believe that the typical sequences all have the same constant probability 
of occurrence, but we will see in the next section that this intuitive argument 
can be made rigorous. 

2.1.3 Typical sets 

Much of information theory is based on the concept of typical sequences. 
In the i.i.d. regime, nearly all of the sequences produced by the source have the 
same probability of occurrence. Consider the following theorem which makes 
precise our earlier argument. 

Theorem 2.5 (Asymptotic equipartition theorem). Let Xi, X 2 , . . . , X n be a 

sequence of independent random variables distributed according to p{x), then 

lim Pr {\-^\ogp(X 1 ,X 2 , . . . ,X n ) - H(X)\ > e} = 0, Ve > 0. (2.6) 
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In other words, for large enough n, the probability that a sequence occurs 
approaches 2~ nH( - x ^ — a constant value. The result can also be interpreted in 
a different manner: sequences that have probability different from 2~ nH<yX ' > are 
not likely to occur. Using this insight, we can partition the space of all possible 
sequences, X n , into two sets. The set of sequences that have probability of 
occurrence close to 2~ nH( - XS) and those that do not. We will call the former the 
set of typical sequences. 

Definition 2.6 (Typical set). The set of entropy typical sequences with re- 
spect to p(x) is the set of all sequences x±, x 2 , ■ ■ ■ , x n satisfying: 

2 -n(H(X)+e) < ..., Xn )< 2~ n W X ^ (2.7) 

we will denote this set Te n \ 

The typical set, Te n \ has the following properties: 

(i) Pr{X n G T e (n) } > 1 - S Ve, 5 and n sufficiently large. 

(ii) |T e (n) | < 2 n ^ x ^ Ve and n sufficiently large. 

Property (i) is a consequence of the asymptotic equipartition theorem and 
says that for large n, most of the sequences that come out of the source will 
be typical. Property (ii) is a bound on the size of the typical set which follows 
from the fact that all typical sequences occur with the same probability. 

The bound on the size of the typical set is at the root of our ability to 
compress information. 

2.1.4 Compression 

Compression, also referred to as source coding, is our ability to encode 
a given source string into a shorter string while preserving most of the infor- 
mation contained therein. More generally, we talk about a compression rate 
which can be achieved for a given source X. 
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Definition 2.7 (Compression rate). We say a compression rate R is achiev- 
able if for all e > 0, there exists N(e) such that for n > N(e), there exist 
maps: 

E n :X n — > M \M\ = 2 nR (2.8) 
D n :M -> X n (2.9) 

such that 

Pi{X n ^Y n }<e (2.10) 

where Y n = (D n oE n )X n . 

Shannon's compression theorem [28] provides a bound the compression 
rates that are achievable for a given source X. 

Theorem 2.8 (Shannon source coding). Let X n = X 1 , X 2 , . . . X n be a se- 
quence of symbols i.i.d. ~ p(x), then any compression rate R which satisfies 

R>H(X), (2.11) 

is achievable for n sufficiently large. 

The idea behind Shannon compression is very simple. We begin by in- 
dexing the set of typical sequences in some order. We know that the size 
of T e (n) is 

therefore labels of length \H(X) + e] bits will be sufficient to index the typical 
sequences. The encoding operation E n for a given string x n consists of: 

• Recording the index of x n if x n G and 

• Rejecting the string and recording "error" if x n i T ( e n) . 

The decoding operation D n simply takes the index record and replaces it with 
the original string. 
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Because of Property (i) of the set of typical sequences, we know that the 
"error" condition will occur rarely: 

Pr{x n <£ T e (n) } < 5 Ve, 5 and n sufficiently large. (2.13) 

This guarantees the low-error condition Pr{X™ ^ Y n } < 5 for any 5 > 0. 
Shannon's coding theorem holds since the rate R = \H(X) + e] is achievable 
for any e provided n is large enough. 

2.1.5 Multiple sources 

When we consider situations involving more than one source, some new 
information theoretic quantities become relevant. Consider now two sources 
X and Y distributed jointly according to p(x, y). We will denote the marginals 
P( x ) = T, y P( x >y) andp(y) = 52 x p(x,y). 

Fist we define the quantity 

H(X\Y) = -J2p(^y)\og^f (2.14) 

x,y P[ > y > 

= J2p(y) H ( x \ Y = y) ( 2 - 15 ) 

y 

= H(XY) — H(Y) (2.16) 

which is known as conditional entropy. The conditional entropy measures the 
uncertainty in X that remains if we know the value of Y. 

The quantity that quantifies how much information is shared between two 
sources is 

I(X : Y) = H(X) + H(Y) - H(X, Y) (2.17) 

and is usually referred to as the mutual information. Two sources which have 
zero mutual information are independent. 

The mutual information plays a key role in the characterization the infor- 
mation carrying capacity of memoryless channels. Together the compression 
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and channel capacity formulas are the two pillars of Shannon's information 
theory. In this thesis we focus mainly on compression problems and refer the 
reader interested in channel capacities to the classic texts [26, 32]. 

In order to get a better intuitive understanding of the conditional entropy 
and the mutual information, we often use a Venn-like diagram to represents 
them as in Figure 2-1. 




Figure 2-1: Graphical representation of the conditional entropy and the mu- 
tual information. 

Furthermore, one can define the conditional mutual information by con- 
ditioning the mutual information formula on a third system Z. 

I(X:Y\Z) = H(X\Z) + H(Y\Z)-H(XY\Z) (2.18) 
= H(XZ) + H(YZ) -H(XYZ) - H(Z). (2.19) 

The conditional mutual information measures the correlations between X and 
Y that are not shared with the variable Z. 

2.1.6 Slepan- Wolf coding 

Next we turn to the compression of two correlated sources X and Y 
distributed according to p(x,y). If the two sources can be encoded together, 
then according to the Shannon's theorem a compression rate of H(X, Y) can 
be achieved. The more interesting problem requires the sources to be encoded 
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separately without communication between the encoders. This is known as 
the Slepian-Wolf source coding problem [13]. 

Using the coding scheme suggested by Slepian and Wolf, we can compress 
at rates (Rx, Ry) for X and Y respectively if they satisfy the inequalities 

R x > H(X\Y), 

R Y > H(Y\X), (2.20) 

R x + Ry> H{XY). 

This set of inequalities corresponds to an achievable rate region in the (Rx, Ry)- 
plane, as illustrated in Figure 2-2. 

Ry t 



H(Y) - 
H(Y\X) - 




H(X\Y) H(X) 



Rx 



Figure 2-2: The classical Slepian-Wolf rate region. The points a and f3 are 
two corner points of the region. 

To prove that the Slepian-Wolf rate region is achievable, we only need 
to show protocols which achieve the rates of the two corner points a and f3. 
Any rate pair on the line between the two corner points can be achieved by 
time sharing. All other points in the rate region can be obtained by resource 
wasting. 

The proof that the corner points are achievable relies on a coding scheme 
based on random bins and the properties of jointly typical sequences. A string 
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(x n ,y n ) is jointly typical if x n is typical according to p(x), y n is typical ac- 
cording to p(y) and (x n , y n ) is typical according to p(x, y). To encode, we will 
randomly assign to each string x n an index i(x n ) G {1, 2, . . . , 2 nRx }. Simi- 
larly, to each y n we assign an index j(x n ) G {1, 2, ... , 2 nRr }. The decoding 
operation takes the received indices and tries to reproduce a copy of the 
original string (x n ,y n ). 

In the case of point a from Figure 2-2, the rates correspond to 



in the limit where e\ and e 2 go to zero. According to Shannon's source coding 
theorem (Theorem 2.8), the rate of equation (2.21) is sufficient to faithfully 
decode the string x n , i.e. with high probability x n = x n . The decoder then has 
to find the string y n which is jointly typical with the decoded x n and this is 
possible provided the rate Ry is greater than the conditional entropy H(Y\X). 
The coding scheme for point (3 is analogous. 

The multiparty version of the Slepian-Wolf problem was considered in 
[33, 34]. In the multiparty case, we have not two but m sources X ± ,X 2 , . . . , X m 
which are to be encoded separately and decoded by a common receiver. We 
want to know the optimal rate tuple (Ri, R2, ■ ■ ■ , R m ) at which we can com- 
press the corresponding sources such that the information can be recovered 
faithfully after decoding. It is shown in [34] that the rates have to satisfy the 
following set of inequalities 



R x = H(X) + e 1 , 



(2.21) 



R Y = H(Y\X) + e 2 . 



(2.22) 




(2.23) 
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for all K C {l,2,...,ra}, K>= {1, 2, . . . , m} \/C and := {Xj : i E £}. Note 
that the two-party inequalities (2.20) are a special case of the more general 
multiparty result. 

2.2 Quantum information theory 

The fundamental ideas of quantum information theory are analogous to 
those of classical information theory. In addition to the classical sources and 
channels, we simply introduce a new set of fundamental building blocks in our 
studies. These quantum resources governed by the laws of quantum mechan- 
ics can exhibit strange and non-intuitive behaviour but can nevertheless be 
studied with the techniques of information theory. 

2.2.1 Quantum states 

The fundamental principles of quantum mechanics are simple enough to 
be explained in the space available on the back of an envelope, but to truly un- 
derstand the implications of these principles takes years of training and effort. 
We assume the reader is familiar with basic notions of quantum mechanics 
[35, 27]. This section will focus on specific notions and notation that are used 
in quantum information theory. 

We will denote quantum systems by uppercase roman letters like A, B, R 
and the corresponding Hilbert spaces as H A ,H B ,H R with respective dimen- 
sions d,A,dB,dR. We denote pure states of the system A by kets: \ip) A and 
density matrices as (p A . Because of the probabilistic interpretation of quantum 
mechanics, all kets have unit norm and all density matrices are positive and 
with unit trace. We will refer to both kets and density matrices as states. 

We use the partial trace operator to model partial knowledge of a state. 
Given a bipartite state p AB shared between Alice and Bob, we say that Alice 
holds in her lab the reduced density matrix: p A = Tr B p AB , where Tr s denotes 
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a partial trace over Bob's degrees of freedom. In general the state produced 
in this manner will be mixed - a classical probability distribution over states. 

Conversely, any mixed state a A G 2 H A can be purified to a fictitious 
larger Hilbert space. That is, we imagine a corresponding pure state \a) G 
H. A ®'H R such that taking the partial trace over the R system gives the original 
state: Tr R (\<y)(<y\ AR ) = cr A . The purification procedure is often referred to as 
escaping to the Church of the larger Hilbert space in literature. 

2.2.2 von Neumann entropy 

Analogously to classical information theory, we quantify the information 
content of quantum systems by using an entropy function. 
Definition 2.9 (von Neumann Entropy). Given the density matrix p A G 7i A , 
the expression 

H{A) p = -Tx{p A \ogp A ) (2.24) 

is known as the von Neumann entropy of the state p A . 

Certain texts use the alternate notation S(A) p for the von Neumann en- 
tropy to distinguish it from the classical Shanon entropy, but we choose not 
to make this distinction here. This overloading of notation is warranted since 
the von Neumann entropy is in fact the Shannon entropy of the eigenvalues of 
the state. Given the spectral decomposition of the state p A = J2i K\ e i){ e i\, we 
can calculate H(A) p = — Tr [p A \ogp A ) = — J^AjlogA;. The von Neumann 
entropy of a pure state is zero, since it has only a single eigenvalue. 



2 Strictly speaking, we should say a A G D(H A ) where D(H A ) is the set of 
density matrices over H A . We will use this economy of notation consistently. 
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For bipartite states p AB we can also define the quantum conditional en- 
tropy 

H(A\B) p := H{AB) P - H{B) P (2.25) 

where H{B) P = — Tr (p^log/r 8 ) is the entropy of the reduced density matrix 
p B = Tta (p AB ) • In the same fashion we can also define the quantum mutual 
information information 

I(A; B) p := H(A) p + H(B) p - H(AB) p (2.26) 

and in the case of a tripartite system p ABC we define the conditional mutual 
information as 

I(A;B\C) P := H(A\C) P + H(B\C) P -H(AB\C) P (2.27) 
= H(AC) P + H(BC) P -H(ABC) P -H(C) P . (2.28) 

It can be shown that I(A; B\C) is strictly non negative for any state p ABC . 
The formula I (A; B\C) > can also be written in the form 

H(AC) + H{BC) > H{C) + H(ABC). (2.29) 

This inequality, originally proved in [36], is called the strong subadditivity of 
von Neumann entropy and forms an important building block of quantum 
information theory. 

On the surface, it may appear to the reader that quantum information 
theory has nothing new to offer except a rewriting of the classical formulas in a 
new context. This observation is highly misleading. We present the following 
example to illustrate some of the new aspects of quantum information theory. 
Example 2.10. Consider the $ + Bell state 

\§) AB = i (\m) AB + \ll) AB ). (2.30) 
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This state exhibits a form of quantum correlation called entanglement that 
is fundamentally different from classical correlation. The associated density 
matrix is $ AB = |$)(<l ) | j4 ' B 7 which has the reduced density matrices & A = $ s = 
|(|0)(0| + |1)(1|). 

Next we calculate the entropy of the two subsystems A, B and the system 
as a whole 

H(A)<s> = 1, H(B)z = 1, H(AB)<s, = 0, (2.31) 

since & A , & B are maximally mixed and \<&) AB is pure. Using these results, it 
is now simple to calculate the conditional entropy 

H(A\B) = H(AB) - H{B) = -1 [bits], (2.32) 

and the mutual information 

I (A; B) = H(A) + H{B) - H(AB) = 2 [bits]. (2.33) 

Equation (2.32) illustrates one of the key differences between classical in- 
formation theory and quantum information theory: the fact that conditional 
entropy can be negative. How can we interpret negative values as uncertain- 
ties? Also, it is not immediately clear what we mean by conditioning on a 
quantum system in the first place. These issues will be discussed in some 
detail in Section 3.3 where we will give the conditional entropy an operational 
interpretation. 

In classical information theory, the mutual information between two bi- 
nary sources attains its maximal value of 1 when the two sources are perfectly 
correlated. As we can see from equation (2.33), in the quantum world two 
qubits can be, in some sense, more than perfectly correlated and have mutual 
information as much as 2 bits! 
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2.2.3 Quantum resources 

The current trend in quantum information theory is to look at communi- 
cation tasks as inter-conversions between clearly defined information resources. 
To render the resource picture generic, we always imagine a scenario in which 
two localized parties, usually called Alice and Bob, want to perform a certain 
communication task. Local computation will be regarded as free of cost in 
order to focus on the communication aspects of the task. 

An example of a classical communication resource is the noiseless channel 
from Alice to Bob, denoted [c — > c]. The symbol [c — > c] represents the ability 
to send one bit of information from Alice to Bob. A related classical resources 
is the noisy channel, denoted {c — > c} which is usually modeled as a mapping 
M X ^ Y , described by a conditional probability p(Y = y\X = x) where X is the 
input variable sent by Alice and Y the random variable received by Bob. The 
noiseless channel [c — > c] is, therefore, a special case of the general channel 
{c — > c} with the identity mapping M = \ X ^ Y from X to Y. Another classical 
resource denoted [cc] represents a random bit shared between Alice and Bob. 

Quantum information theory introduces a new set of resources. In analogy 
to the classical case, we have the noiseless quantum channel [q — > q] which 
represents the ability to transfers one qubit, a generic two dimensional quantum 
system, from Alice to Bob. A noisy quantum channel, {q — > q}, is modeled 
by a mapping Af A ~* B which takes density matrices in Ti A to density matrices 
in 7i B . The mapping Af is a quantum operation: a completely positive trace 
preserving (CPTP) operator [27]. 

One key new resource of quantum information theory is the maximally 
entangled state shared between Alice and Bob 




(2.34) 
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which we denote [qq\. Note that, since local operations are allowed for free 
in our formalism, any state |$') AB = U A ®U B \&) AB where U A ,U B are lo- 
cal unitary operations is equivalent to |$) B . Entanglement is a fundamental 
quantum resource because it cannot be generated by local operations and clas- 
sical communication (LOCC). The precise characterization of entanglement 
has been a great focal point of research in the last decade. For an in depth 
review of the subject we refer the readers to the excellent papers [20, 37]. 

Entanglement forms a crucial building block for quantum information 
theory because it can be used to perform or assist with many communication 
tasks. In particular, two of the first quantum protocols that ever appeared 
involve ebits, or entangled bits. The quantum teleportation protocol [38] uses 
entanglement and two bits of classical communication to send a quantum state 
from Alice to Bob 

[qq} + 2[c^c] > [q - q], (TP) 

while the superdense coding protocol [39] uses entanglement to send two clas- 
sical bits of information with only a single use of a quantum channel 

[qq] + [q^q] > 2[c->c]. (SC) 

The above resource inequalities indicate that the resources on the left hand 
side can be used to simulate the resource on the right hand side. 

The two protocols (TP) and (SC) are only the tip of the iceberg: there are 
many more protocols and fundamental results in quantum information theory 
that can be written as resource inequalities. In Section 3.2 we will introduce 
some of them and the relationships that exist between them. 
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2.2.4 Distance measures 

In order to describe the "distance" between two quantum states we use 
the notions of trace distance and fidelity. The trace distance between quantum 
states a and p is 

TD(p, a) :— \\p — a\\i — Tr\p — a\ (2.35) 

where \X\ = VX^X. 

The fidelity between two pure states is simply the square of their inner 
product 

F(\v),m = \(m\ 2 - (2.36) 

The most natural generalization of this notion to mixed states p, a is the 
formula 

F(p, ( r)=Tr(^/ y /pa y /p^ . (2.37) 

Two states that are very similar have fidelity close to 1 whereas states with 
little similarity will have low fidelity. 

Note that some texts, (ex: [27]) define the trace distance with an extra 
normalization factor of \ and write the fidelity without the square. These 
differences of convention do not affect any of our findings but are important 
to point out to avoid confusion. 

The trace distance and fidelity measures are related, that is if two states 
p and a are close in one measure they are also close in the other [40]. More 
precisely, the quantities TD and F satisfy the following inequalities 

1-VF < \TD < y/l-F, (2.38) 
l-TD < F < 1-^. (2.39) 

Thus, if for certain states F > 1 - e, then TD < 2y/e. Also, if TD < e, then 
F > 1 - e. 
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2.2.5 Ensemble and entanglement fidelity 

The concept of an identical, independently distributed (i.i.d.) source also 
exists in quantum information theory. However, there are a number of ways 
we can adapt the concept to the quantum setting so some clarifications are in 
order. 

An ensemble S = {p i: \ipi)} is a set of quantum states which occur 
with probability Pi. One way to describe a quantum source is to specify the 
states \ipi) and the corresponding probabilities Pi associated with this source. 
Using this ensemble characterization we can specify what it means to success- 
fully perform a communication protocol with that source. Let Af A ^ A with 
input \ip) A E H A and output a A G 7i A be the quantum operation associated 
with the protocol: 

N{\m\)=° A - (2-40) 

To measure how faithfully the input state has been reproduced at the output 
we calculate the input-output fidelity F{\ip) A , a A ). In order to measure how 
faithfully the source as a whole is reproduced at the output, we have to average 
over the input-output fidelities of the ensemble 

F(£,M) :=^r Pi F(|^>,a,), a t = N{\A)(A\)- (2.41) 

i 

If we want the source to be preserved perfectly then we require F(£,Af) = 1. 
In general, however, we will be content with approximate transmission where 

F(£,Af)>l-e (2.42) 

for arbitrary small e. It turns out that this way of describing the source may 
not be practical or desirable since it requires a detailed knowledge of the inner 
workings of the source — something that is often impossible to obtain even in 
theory. 
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The better way to describe a quantum source is specify only the aver- 
age density operator p = ^Pil^X^! f° r that source. This characterization 
could be obtained through state tomography [27] and does not presuppose any 
knowledge of the ensemble which generates p. This description is more gen- 
eral because the results we obtain for the density matrix p will hold for all 
ensembles {p h |^)} such that p = J^PilV'iXV'il- 

This also leads us to an alternative and simpler way of judging the success 
of a quantum protocol that relies on the idea of the Church of the larger Hilbert 
space. Let l^)^ be a purification of p A to some reference system R. This 
reference system is entirely fiducial and does not participate in the protocol. 
In the larger Hilbert space H A ® H R the M A ^ A operation acts as 



M A ^%l R 



AR 



) = a AR . 



(2.43) 



The operation is shown as a quantum circuit in Figure 2-3. 

R 




> o 



AR 
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M A ^ A 
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Figure 2-3: A quantum circuit which shows M acting on the A system while 
the reference, R, is left unperturbed. 

For approximate transmission, we now require the fidelity between the 
pure input state \ip) AR and the possibly mixed output state a AR to be high 



Fm AR ,a AR ) = 



AR\ a AR^ARj > 1-e 



(2.44) 



Equation (2.44) measures the entanglement fidelity of the operation: how well 
the protocol manages to transfers the i?-entanglement from the A system to 
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the A system. It can be shown [41] that if the channel M has high entanglement 
fidelity then the average fidelity F(£,Af) will also be high for any ensemble S 
such that p A = X^Pil^iX^il- m other words, equation (2.44) implies equation 
(2.42). The entanglement fidelity paradigm has the advantage that the input 
state to the protocol is pure, which makes our analysis much simpler. Also, in 
this paradigm we are certain that any correlations the A system might have 
with other systems are preserved because of monogamy of entanglement. 

In the i.i.d. setting, we operate simultneously on n copies of the same 
input state p A . We denote the tensor product of all the input states by p A " = 
p A <8> • • • <E> p A (n-copies). The quantum operation becomes J\f An ~^ An and the 
output state will be a A " . The entanglement fidelity 

F{\i,) AnR \a AnRn ) = (^ AnRn \ a AnRn \^ AnRn ) > 1 - e(n), (2.45) 

is now a function of n, the block size of the protocol. Thus, in the i.i.d. setting 
we say the protocol implemented by Af succeeds when e(n) — > as n — > oo. 
More formally, for any required precision eo, there exists an N(e) such that 
for all n > N(e), there exist n-dependent maps Af such that e(n) < e in 
equation (2.45). 



CHAPTER 3 
Results in quantum information theory 

This chapter is dedicated to four landmark results in quantum information 
theory. The first of these is Schumacher compression, the quantum version of 
source coding [42]. The second is the resource framework of quantum informa- 
tion theory [4], which defines rigorously the properties of quantum protocols 
and discusses the relationships between them [3]. Then, in section 3.3, we focus 
our attention on one protocol for compression of quantum information with 
side information known as state merging [15, 43]. Finally, in the last section 
of this chapter, we discuss in detail the fully quantum Slepian-Wolf (FQSW) 
protocol for state transfer and simultaneous entanglement distillation. To a 
large extent, the multiparty results in this thesis are a direct generalization of 
the two-party FQSW protocol, therefore, section 3.4 is of central importance 
to the remainder of the argument. 

3.1 Schumacher compression 

If classical information theory is 60 years old [28] then quantum informa- 
tion theory must be 12 years old. Indeed, we can say that Schumacher laid 
the foundations of quantum information theory with his 1995 paper [42] where 
he showed that the von Neumann entropy, H(p), plays the analogous role of 
Shannon entropy for quantum systems. Namely, it has operational interpre- 
tation as the number of qubits necessary to convey the information from a 
quantum source p. 
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3.1.1 Typical subspace 

The notion of a typical set (section 2.1.3) can easily be generalized to the 
quantum setting. Consider a source which produces many copies of the state 
p A which has spectral decomposition p A = X)i I 

In the the i.i.d. regime, the state produced by the source is given by 
p A " = p Al ®p A2 ® • ■ • ®p An which can be written as 



(3.1) 



p An =5^A il ---A in \h){h\ Al ®- ■ -®\in)(in\ An 

We now define the typical projector as follows 

n^ n) = liiXiil^l^X^I^.-linXinl^ 

= Yl \nn An , (3.2) 

where we sum over all the typical sequences with respect to the classical 
probability distribution p(i) := Aj. 

We call the support of the typical subspace of 7i An associated with p A . 
The typical subspace, by its construction, inherits the characteristics of the 
typical set. Indeed, 11^ has the following properties 



(i) Tr 

(ii) Tr 



> 1 — 5 V5, e > and n sufficiently large. 
< 2 n W A ^ +e ^ Ve > and n sufficiently large. 

Property (i) says that, for large n, most of the states produced by the 
source will lie mostly inside the typical subspace. Property (ii) is a bound on 
the size of the typical subspace which follows from the classical bound on the 
size of the typical set ■ These two properties are at the heart of our ability 
to compress quantum information. 
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3.1.2 Quantum compression 

Analogously to the classical case, we have the notion of a quantum com- 
pression rate. In the quantum regime, we use the entanglement fidelity (see 
Section 2.2.5) to measure how well the state is reproduced after decoding. 
Definition 3.1 (Quantum compression rate). We say a compression rate R 
for the source p A is achievable if for all e, there exists N(e) such that for 
n > N(e), there exist maps: 

E n : H A " — > M \M\ =2 nR (3.3) 
V n :M -> H An (3.4) 

such that the purification R of p A " satisfies 

Fm AnR " , a AnRn ) = (^| a AnRn \^) AnRn > 1 - 6. (3.5) 

where a AnRn = V n o£ n ®l Rn (|^)(^D- 

Theorem 3.2 (Schumacher noiseless coding). An i.i.d. quantum source p A 
can be compressed at a rate R if R> H(A) p and cannot if R < H{A) p . 

The idea behind the Schumacher compression result is simple. We encode 
by performing the measurement 

M e = {nW, 1-ITf)}. (3.6) 

If ni™' ) occurs, we keep this state since it is typical. Otherwise, if (1 — n^) 
occurs, we replace the state with some fixed state |err) as an indicator that 
an error has occurred. The decoding operation T> n is the identity operation. 
Property (i) from the previous section guarantees that the probability of error 
tends to zero when n becomes large. Also, since we only send states within the 
typical subspace, Property (ii) gives us a bound on the amount of quantum 
information necessary to convey this state. 
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Note that the compression protocol described above works both for sce- 
narios where the mixed state is obtained from a stochastic average over pure 
states p A = ^2iPi\i^i){i^i\ A and scenarios where the density matrix is part of a 
larger pure state p A = Tr^X^!^. 

3.2 Quantum protocols as resource inequalities 

Most of the old results of quantum information theory form a loose col- 
lection of coding theorems, each of them with applications only to one specific 
communication task. Recently, there has been a push to organize these results 
into a unified framework of resource inequalities [2, 3, 4, 5]. A resource inequal- 
ity is a quantitative statement regarding inter-conversions between clearly de- 
fined generic information resources. The key benefit of such a framework is 
that, like Lego blocks, we can build one communication protocol based on 
another, and generally work at a higher level of abstraction then is possible 
when working with the specifics of each protocol. 

3.2.1 The framework 

A unified framework for both classical and quantum information theory 
was developed in [4]. The notions "resource" and "protocol" are clearly de- 
fined as well as the rules for combining and composing them. In particular, 
this framework deals with the class of bipartite, unidirectional communication 
tasks involving memoryless channels and sources in the i.i.d regime. 

Borrowing from the cryptography heritage, the two main participants 
in the protocols are called Alice and Bob. Alice is usually the sender, and 
performs some encoding operation while Bob does the decoding. Additionally, 
the framework introduces two novel participants Eve and the Reference. We 
use Eve to model information lost to the environment in a noisy channel. The 
reference R is a fiducial purification system which allows us to deal with mixed 
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states in a simple manner as discussed in Section 2.2.5. Most important of 
all, the framework introduces the Source, which produces some state p s and 
distributes it to the participants before the beginning of the protocol. 

In Section 2.2.3, we introduced some of the resources of information theory 
like the noiseless classical channel [c — > c], noisy classical channel {c — > c} and 
the quantum equivalents [q — > g] and {q — > g}. In order to be more precise, 
we sometimes use a different notation for noisy channels 

{q^q}= (Af) (3.7) 

which explicitly shows the map Af associated with that channel. Note that 
the angle brackets (.) indicate that we are working in the asymptotic regime 
of many copies of the resource: (A/") ~ Af® n and (p AB ) ~ (p AB )® n . We will 
denote a relative resource as (N : p A ) which is a channel guaranteed to behave 
as the channel Af provided the input state is exactly p A . 

As a first example of the protocol framework, consider the Schumacher 
compression result from the previous section. It can be represented by the 
following resource inequality 

(H(B) a + S) -> q] > (1 A ^ B : p A ) (3.8) 

for any 5 > and where a B := l A ^ B (p A ). The above equation indicates that 
(H(B)+S) qubits are sufficient to accurately convey the information contained 
in the state p A to another party. 

3.2.2 The family of quantum protocols 

Many protocols of quantum information theory deal with the conversion 
of some noisy resource into the corresponding noiseless version possibly with 
the use of some auxiliary resources. It turns out that many of these protocols 
are related and it is sufficient to prove two protocols of this type and all other 
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protocols follow as simple consequences when we apply teleportation (TP) or 
superdense coding (SC) either before or after these protocols [3]. 

The two protocols which generate all the others of this "family tree" 
are called the mother and father protocols. The mother protocol takes the 
static resource (p AB ) and some quantum communication to distill maximally 
entangled bits. The resource inequality is 

(P AB ) + R)AQ - Q] > \l{A\ BUqq], (?) 

where the entropies are taken with respect to a purification \tp) ABR of p AB . 
The mother protocol can be used to derive three "children" . The first of these 
is entanglement distillation, also known as the hashing inequality [44]. We 
start with equation (5) but implement the [q — > q] term as teleportation 

h{A- R) {[qq] + 2[c - c] > [q - q]j . (3.9) 

After canceling some of the [qq] terms on both sides we obtain 

(p AB ) + I (A; RUc ^c]> I c {A)B)^qq], (3.10) 

where I C (A)B) = \l(A;B) - \l{A;R) = H(B) - H(AB). The mother in- 
equality can also be used to derive noisy versions of the teleportation [3] and 
superdense coding protocols [45]. 

The father protocol takes the dynamic resource of a noisy quantum chan- 
nel (J\f A ^ B ) and some additional entanglement to simulate a noiseless quantum 
channel. Consider a setup where we send half of the state \(f) AR through the 
channel Af, which we model as an isometric extension Ufif^ BE to an environ- 
ment E. The resulting state is \ip) BER = Uj^ BE ®l R \4>) AR and the resource 
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inequality is 

(N A ^ B ) + ±J(i2; > ^(«; - 9]. (tf) 

Using the father inequality and the superdense coding result (SC), we can de- 
rive the formula for the entanglement-assisted classical capacity of a quantum 
channel [8] 

(Af A - B : p A ) + H(R)4qq} > I(R; B)^[c -> c\. (3.11) 

More importantly, we can obtain the important quantum capacity result, the 
LSD Theorem [46, 47, 9], named after Lloyd, Shor and Devetak: 

(N) > I c (R)B)^[q -> q\. (3.12) 

Furthermore, it turns out that equation (5) and (cf) are related. We 
can obtain one from the other by replacing dynamic resources with static 
resources and adjusting for the definitions of A and R. This duality could 
be a mere coincidence or it could be indicative of some hidden structure. We 
will see in section 3.4 that in fact there exists an even bigger mother! The 
FQSW protocol, sometimes called "the mother of all protocols" , is a quantum 
protocol that generates both the mother and father protocols as well as many 
other protocols that were not part of the original family tree [5, 48]. 

3.3 State merging 

Consider a setup where Alice and Bob share the state p AB = Tr R |'0)('0| AB - R . 
We would like to know how much quantum information Alice needs to send 
to Bob to merge her part of the state into Bob's. The problem is illustrated 
graphically in Figure 3-1. 
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Figure 3-1: Pictorial representation of the state merging protocol. Alice's part 
of \ip) ABR is merged with Bob's part. In the end, the purification of R is held 
entirely in Bob's system. 

In the limit of many copies of the state, the rate at which Alice needs to 
send quantum information to Bob is given by the formula 



provided classical communication is available for free. The primitive which 
optimally achieves this task is called the state merging protocol [15, 43]. We 
will discuss this protocol in some detail in section 3.3.2 but before that we 
dedicate some time to the quantum conditional entropy. 

3.3.1 Quantum conditional entropy 

The classical notion of conditional entropy H(X\Y) is the amount of 
communication needed to convey the information content of the source X 
given knowledge of the variable Y at the decoder. As we saw in section 2.1.6, 
the conditional entropy is naturally suited to application in the Slepian-Wolf 
problem of distributed compression. 

When we try to adapt the conditional entropy to the quantum world, we 
run into a number of conceptual difficulties. Indeed, in order to define H(A\B) p 
for a quantum state p AB we need to replace the classical notion of a conditional 
distribution with some concept better suited to density matrices [49, 50]. A 
more pragmatic approach is to simply mimic the form of equation (2.16) from 
Section 2.1.5 and write the conditional entropy as a difference of two regular 



R > H(A\B) 



(3.13) 
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entropies 

H{A\B) p := H(AB) p - H{B) p . (3.14) 

In this way, we obtain a formula for the conditional entropy but still lack 
an interpretation. The situation is complicated further by the fact that the 
quantum conditional entropy can take on negative values seemingly indicating 
that it is possible to know more about the global state than about a part of 
it! Also to be explained is the relation between the negative values of the 
conditional entropy and the presence of quantum entanglement as indicated 
by the entropic Bell inequalities [51]. 

The interpretation issues around the quantum conditional entropy were fi- 
nally settled in a satisfactory manner in two recent papers [15, 43], in which the 
quantum conditional entropy is given an operational interpretation in terms 
of the state merging protocol. 

3.3.2 The state merging protocol 

Consider a state p AB shared between Alice and Bob and a purification 
of that state \ip) ABR . We want to send Alice's part of the state to Bob by 
using an unlimited amount of classical communication and as little quantum 
communication as possible. Let $k e H. a ° Bo ,&l € H AlBl be two maximally 
entangled states of rank K and L respectively. The state merging protocol 
takes as inputs the state | \ ABR and log if ebits in the form of §k and 
applies the quantum operation M. : AA <S> BB — > A\ <g> B\BB to produce a 
state 

a A ^ BBR = (M®l R ){\^M ABR ®$ K ). (3.15) 

We want the state p AB to be transferred entirely to Bob's lab: a BB ps p AB . 
In addition, logL ebits are generated by the protocol if a AlBl <& l . More 
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precisely, we measure the success of the protocol by the entanglement fidelity 

F (a MBl8BR , <g) \^M ABR ) > 1 - e. (3.16) 

In our original formulation of the state transfer task we asked how much 
quantum communication from Alice to Bob is necessary, yet in the above for- 
mulation we only speak of entanglement being consumed and generated. This 
is so because the two resources become equivalent when unlimited classical 
communication is allowed: 

[qq] = [q — > q] (free classical communication). (3.17) 

Thus, we can say that § K is the entanglement consumed by the protocol 
while $l is the entanglement generated. In the i.i.d. regime, where \ip) ABR = 
(\(fi) ABR ^ , we define the entanglement rate 

i2= -(logtf-logL), (3.18) 
n 

which can take on both positive and negative values. When R > 0, the 
entanglement resource has been consumed by the protocol, but when R < 
the protocol is actually generating entanglement as stated in the following 
theorem. 

Theorem 3.3 (Quantum state merging [43]). For a state p AB shared by Alice 
and Bob, the entanglement cost of merging is equal to the quantum conditional 
entropy H (A\B) = H(AB)—H(B). When H(A\B) is positive, merging is pos- 
sible only if R > H(A\B) ebits per input copy are provided. When H{A\B) 
is negative, the merging is possible by local operations and classical communi- 
cation and moreover, R < —H{A\B) maximally entangled states are obtained 
per input copy. 
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We can express the state merging protocol as a resource inequality 

(U s ^ AB :p s ) + H(A\BUq^q] > (1 S ^ B : p s ) (free [c <- c}) (3.19) 

where JJ S ~* AB is an isometry, p AB = U s ~* AB (p s ) that splits the state produced 
by the source between Alice & Bob while 1 S ^ B gives the state directly to Bob. 
The net effect of (JJ S ^ AB : p s ) n the left hand side and (1 S ^ B : p s ) on the 
right, is the state transfer resource informally defined 

(1 A ^ B : p AB ) := (1 S ^ B : p s ) - (U S ^ AB : p s ). (3.20) 

According to equation (3.19), this resource can be an asset or a liability de- 
pending on the sign of H(A\B). 

The state merging protocol has numerous applications. It can be used 
to study the quantum capacity of multiple access channels, entanglement 
distillation [11], entanglement of assistance [52] and distributed compression. 
The latter of these is of particular relevance to the subject of this thesis since 
it is a quantum generalization of the Slepian-Wolf problem discussed in sec- 
tion 2.1.6. Alice and Bob have to individually compress their shares of a 
state p AB and transmit them to common receiver, Charlie. We allow unlim- 
ited classical communication and rates Ra, Rb of quantum communication to 
Charlie. The rate region for quantum distributed compression is given by the 
inequalities 

R A > H(A\B) p , 

R B > H(B\A) p , (3.21) 
R A + R B > H{AB) P . 

The rates for quantum distributed compression (3.21) should be compared 
with the classical distributed compression rates (2.20). This is an instance of 
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a general trend in quantum information theory: if classical communication is 
available for free, the solution to the quantum analogue of a given classical 
communication task is identical the classical solution up to replacement of 
Shannon entropies by von Neumann entropies. Many times, however, this "if 
goes to S rule" is only skin deep and sometimes it does not hold at all. 

In the next section we will give the details of the fully quantum Slepian- 
Wolf (FQSW) protocol, which is a generalization of state merging where no 
classical communication is allowed. In the light of this, a detailed proof of the 
state merging protocol has been omitted for the sake of brevity and since it 
follows from the more powerful FQSW protocol. 

3.4 The fully quantum Slepian-Wolf protocol 

The fully quantum Slepian-Wolf protocol [5] is a procedure for simultane- 
ous quantum state transfer and entanglement distillation. It can be thought 
of as the quantum version of the classical Slepian-Wolf protocol but, unlike 
the state merging protocol considered above, no classical communication is 
allowed. This FQSW protocol generates nearly all the other protocols of quan- 
tum information theory as special cases, yet despite its powerful applications 
it is fairly simple to implement. 



R A R A 2 




Figure 3-2: Diagram representing the ABR correlations before and after the 
FQSW protocol. Alice manages to decouple completely from the reference R. 
The B system is isomorphic to the original AB: it is the purification of R. 

The state \t/j) ABR = (\<p) ABR ^ is shared between Alice, Bob and a 
reference system R. The FQSW protocol describes a procedure for Alice to 
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transfer her i?-entanglement to Bob while at the same time generating ebits 
with him. Alice can accomplish this by encoding and sending part of her 
system, denoted A±, to Bob. The state after the protocol can approximately be 
written as \ $) A2B (\(p) RB )® n , where the systems B and B are held in Bob's lab 
while A 2 remains with Alice. The additional product, |$) A2B , is a maximally 
entangled state shared between Alice and Bob. Figure 3-2 illustrates the 
entanglement structure before and after the protocol. 

3.4.1 The protocol 

The protocol relies on an initial compression step and the mixing effect of 
random unitary operations for the encoding. We assume that, before the start 
of the protocol, Alice and Bob have pre-chosen a random unitary operation U a- 
Equivalently, they could have shared random bits which they use to locally 
generate the same unitary operation. 

The protocol, represented graphically in Figure 3-3, consists of the fol- 
lowing steps: 

1. Alice performs Schumacher compression on her system A to obtain the 
output system A s . 

2. Alice then applies a random unitary Ua to A s . 

3. Next, she splits her system into two parts: A±A 2 = A s with dA 1 = 2 nQA 
and 

Q A >h(A;K) ip . (3.22) 

She sends the system A 1 to Bob. 

4. Bob, in turn, performs a decoding operation V^ lB ^ BB which splits his 
system into a B part purifying R and a B part which is fully entangled 
with Alice. 
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Figure 3-3: Circuit diagram for the FQSW protocol. First we Schumacher 
compress the A system, then we apply the random unitary encoding U a- At 
the receiving end Bob applies a decoding operation V. 

The best way to understand the mechanism behind this protocol is by 
thinking about destroying correlations. If, at the end of the protocol, Alice's 
system A 2 is nearly decoupled from the reference in the sense that a A2R ~ 
a A2 <S> cr R , then Alice must have succeeded in sending her R entanglement 
to Bob because it is Bob alone who then holds the R purification. We can 
therefore guess the lower bound on how many qubits Alice will have to send 
before she can decouple from the reference. Originally, Alice and R share 
I(A; R)^ bits of information per copy of \(p) ABR . Since one qubit can carry 
away at most two bits of quantum mutual information, this means that the 
minimum rate at which Alice must send qubits to Bob is 



Q A >-I(A;R) ip . 



(3.23) 



It is shown in [5] that this rate is achievable in the limit of many copies of 
the state. Therefore the FQSW protocol is optimal for the state transfer task. 
More formally the decoupling process is described by the following theorem: 

Theorem 3.4 (One-shot decoupling theorem from [5]). 

Let a MR (U) = Tr Al [(U <g> l R )ip ASR {W <g> 1 R )] be the state remaining on A 2 R 
after the unitary transformation U has been applied to A s = A\A 2 . Then 



V(A) 



a 



A 2 R 



(u)- 



d 



a 



2 dU<^Tr[ 
i di. 



\A S R\2} 



(3.24) 
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This theorem quantifies how close to decoupled the A 2 and R systems are 
if a random unitary operation is applied to the A s = AiA 2 system. There 
are several important observations to make in relation to the above inequality. 
First, we note that for a given state \^jj) ABR , the dimensions of the systems A s 
and R as well as the purity Tr[(^j A ' SR ) 2 } are fixed numbers over which Alice 
has no control. Alice can, however, choose the dimension of the subsystem she 
sends to Bob, d^, and influence how decoupled she is from the reference. By 
making making d,A 1 sufficiently large, Alice can thus make the right hand side 
of (3.24) tend to zero. 

Second, the fact that Alice holds something very close to a maximally 
mixed state 1 / cIa 2 indicates that Bob can, by an appropriate choice of decoding 
operation Vb, establish a maximally entangled state \&) A2B with Alice. These 
ebits generated between Alice and Bob are a useful side-effect of the protocol 
that is similar to the entanglement generated by the state merging protocol. 

All that remains now is to specify d,A 1 , the dimension of the system sent 
to Bob, in terms of entropic quantities of the input state. This can be done in 
the the limit where n, the number of copies of the state goes to infinity. Using 
the properties of typical subspaces, we can we can make the right hand side 
of equation (3.24) tend to zero provided the rate Qa = ^log^Ai satisfies [5]: 

Q A >±I(A;R) v + 6 (3.25) 

for any 5 > 0. 

3.4.2 The FQSW resource inequality 

In the spirit of section 3.2 above, we can succinctly express the effects of 
the fully-quantum Slepian-Wolf protocol as a resource inequality 

{U s^ab :(p s )+ i I(A . R)Aq ^ q] > i /(A;5 y gg] + (1 ^ : ^ {3 26) 
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which is read: Given the state \(p) ABR and ^I(A;R) qubits of communication 
from Alice to Bob we can obtain the state \p) RB while also purifying \l{A; B) 
ebits. 

As previously announced, the FQSW protocol is more powerful than the 
state merging protocol of section 3.3 since it generates it as a special case. 
Indeed, when we implement the quantum communication [q — > q] of equa- 
tion (3.26) as teleportation according to equation (TP) 

±I(A;R)[qq] + I(A;R)[c^c] > \l (A; R)[q - q}. (3.27) 

We now "recycle" the entanglement produced by the protocol. The factor in 
front of [qq] is is going to be \I{A; R) — \I{A; B) = H(A\B) and the overall 
resource inequality becomes 

(U S ^ AB :<p s )+H(A\B) ip [qq}+I(A;R) v> [c^c} > (1 S ^ B : </>, (3.28) 

which is exactly the state merging resource inequality (3.19), when we also 
account for the classical communication cost. 

The FQSW inequality generates the mother inequality (5) by discarding 
the additional resource (1 S ~* B : ip s ) on the right hand side. Moreover it was 
recently shown that by the source- channel duality, the FQSW protocol can 
be used to generate the father protocol (d) and by time reversal duality the 
FQSW protocol leads to the fully quantum reverse Shannon (FQRS) proto- 
col [48]. Other notable results related to the FQSW protocol are the recent 
results for broadcast channels [53], and the generalization of the FQSW task 
called quantum state redistribution, which uses side information both at the 
encoder and the decoder [54, 55]. 
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In addition to the its powerful protocol generating faculties, the FQSW 
protocol has applications to the distributed compression problem for quan- 
tum systems. Indeed, the original FQSW paper [5] partially solves the dis- 
tributed compression problem in the two-party case by providing upper and 
lower bounds on the set of achievable rates. In Chapter 5 we will present our 
results on the multiparty version of the same problem. For the sake of continu- 
ity, the reader may wish to skip Chapter 4 on a first reading of the thesis since it 
is a self-contained exposition on the multiparty squashed entanglement, which 
only comes into play relatively late in the distributed compression chapter. 



CHAPTER 4 
Multiparty quantum information 

Many of the protocols of information theory deal with multiple senders 
and multiple receivers. As a whole, however, network information theory, 
the field which studies general multiparty communication scenarios is not yet 
fully developed even for classical systems [26] . Quantum network information 
theory, which deals with quantum multipartite communication, is also under 
active development [56, 57, 25] and, thanks to the no-cloning properties of 
quantum information, sometimes admits simple solutions [56]. On the other 
hand, a full understanding of quantum network theory will require a precise 
characterization of multiparty entanglement, a task which is far from com- 
pleted [21, 22, 23, 24]. Nevertheless, we can hope that years from now we will 
have a rigorous and complete theory of multiparty information theory in the 
spirit of the two-party protocols framework [4]. 

One step toward the development of a multiparty information theory 
would be to generalize the concept of mutual information I(A; B) to more 
than two parties. The mutual information, the information that two systems 
A and B have in common, can be written as 

I{A-B) = H{A)-H{A\B). (4.1) 

The above formula is interpreted as a reduction of the total uncertainty of A 
by the amount that is not common to B. What is left is the uncertainty that 
is shared. 
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Another way to write the mutual information is 



I(A; B) = H(A) + H(B) - H(AB), 



(4.2) 



which adds both entropies (double-counting the entropy that is common) and 
then subtracts the total entropy. Equation (4.2) is a measure of how different 
from independent the variables A and B are. Both of these interpretations of 
the mutual information can be generalized to the multiparty case. 

One way to define the multiparty mutual information for three variables 
A, B and C is by mimicking the form of equation (4.1) above and define 



The motivation behind this formula is to subtract from the mutual information 
I(A; B) any terms that are due to AB-oxAy correlations and not true tripartite 
correlations. The expanded form of the restrictive mutual information is 



I n {A; B; C) = H{A) + H(B) + H{C) - H(AB) - H{AC) - H{BC) + H(ABC). 



This is the information shared by all three parties and corresponds to the 
region labeled "g" in Figure 4-1. This form of multiparty mutual informa- 
tion was defined in [50] but has not yet proved useful in applications. Also, 
I n (A; B; C) can take on negative values [16], which are difficult to interpret. 

Another approach is to define the multiparty mutual information in the 
spirit of (4.2), as the measure of how different from independent the three 
variables are 



In terms of the regions in Figure 4-1, we have I\j{A\ B;C) = d + e + / + 2g. 
This form of the mutual information is naturally connected to the relative 



I n (A;B;C):=I(A;B)-I(A;B\C). 



(4.3) 



I U (A; B; C) := H(A) + H(B) + H{C) - H(ABC). 



(4.4) 
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Figure 4-1: Entropy diagram for three parties A, B and C. 

entropy[27] and also satisfies the chain-like property 

I U (A;B;C) = I(A;B) + I(AB;C), (4.5) 

which indicates how the multiparty information is affected when we introduce 
a new system. 

In this chapter we will investigate some of the properties of the inclusive 
multiparty information I U (A; B;C), henceforth referred to simply as mutual 
information I (A; B;C). Our work on the multiparty information will also 
allow us to naturally extend the notion of squashed entanglement [17] to the 
multiparty scenario. The multiparty squashed entanglement, discussed in Sec- 
tion 4.2, turns out to be a measure of multipartite entanglement with excellent 
properties and clear and intuitive interpretation. It finds application in the 
proof of Theorem 5.3, the outer bound on the rate region for distributed com- 
pression. 
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4.1 Multiparty information 

We begin with the definition of the multiparty quantum information for 
m parties. 

Definition 4.1 (Multiparty information). Given the state p IlX2 "- Im shared 
between m systems, we define the multiparty information as: 

I(X 1 ;X 2 ;---;X m ) p := H^) + H(X 2 ) + ■ ■ ■ + H(X m ) - H^X^ ■ • • X m ) 

m 

= J2 H ( X i)p- H ( X ^--- X m) P (4-6) 

i=l 

The subadditivity inequality for quantum entropy ensures that the mul- 
tiparty information is zero if and only if p has the tensor product form p Xl (g) 
p X2 <g> • • • ® p Xm . 

The conditional version of the multiparty mutual information is obtained 
by replacing all the entropies by conditional entropies 

m 

I(X 1 ;X 2 ;---;X m \E) p := ^ #pQ|£) - H^X* ■ ■ ■ X m \E) 

i=i 

m 

= Yl H { X i E ) - H(X,X2 ■ ■ ■ X m E) - (m - l)H{E) 
i=i 

m 

= I(X 1 ;X 2 ;-- - ;X m ;E)-J2l(X i ;E). (4.7) 

i=i 

This definition of multiparty information has appeared previously in [58, 59, 
60] and more recently in [16], where many of its properties were investigated. 

Next we investigate some formal properties of the multiparty information 
which will be useful in our later analysis. 

Lemma 4.2 (Merging of multiparty information terms). Arguments of the 
multiparty information can be combined by subtracting their mutual informa- 
tion: 



I(A; B- X x - X 2 - ■ ■ ■ ■ X m ) - I {A; B) = I(AB; X ±] X 2 ; ■ ■ ■ ; X m ). (4.8) 
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Proof. This identity is a simple calculation. It is sufficient to expand the 
definitions and cancel terms. 

I(A;B;X 1 ;X 2 ;---;X m )-I(A;B) = 

= H{A) + H{B) +^H(X i ) -H(ABX 1 X 2 ■ ■ ■ X m )-H(A)-H(B)+H(AB) 
= H(AB) + H W ~ H(A, BX i X 2 ■■■Xm) 
= I(AB; Xi, X 2 ; ■ ■ ■ ; X m ). 

□ 

Discarding a subsystem inside the conditional multiparty information can- 
not lead it to increase. This property, more than any other, justifies its use as 
a measure of correlation. 

Lemma 4.3 (Monotonicity of conditional multiparty information). 

I(AB; X,- ■ --X m \E) > I (A; X,; ■ ■ ■ X m \E) (4.9) 

Proof. This follows easily from strong subadditivity of quantum entropy (SS A) . 

I(AB;X 1 ;X 2 ;...;X m \E) = 

= H(ABE) + H ( X i E ) ~ H(ABX X X 2 . . . X m E) - mH(E) 

i 

= H(ABE) + H ( x i E ) ~ H{ABX X X 2 . . . X m E) - mH(E) + 

i 

H(AE) - H(AE) + H(AX 1 X 2 . . . X m E) - H(AX 1 X 2 . . . X m E) 

N v ' N v ' 

=0 =0 

= H(AE) + H{XiE) - H{AX X X 2 ...X m )- mH(E) + 

i 

[H(ABE) + H{AX l X 2 . . . X m E) - H(AE) - E{ABX X X 2 . . . X m E)\ 

N v ' 

>0 by SSA 
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> H(AE) + H ( x i E ) ~ H{AX X X 2 . . . X m E) - mH(E) 

i 

= I(A;X 1 ;X 2 ...X m \E) 

□ 

We will now prove a multiparty information property that follows from a 
more general chain rule, but is all that we will need for applications. 
Lemma 4.4 (Chain- type Rule). 

I(AA'; X 1 ;...; X m \E) > I (A; X 1 ;...; X m \A'E) (4.10) 

Proof. 

I(AA';X 1 ;...;X m \E) = 

m 

= H(AA'E) + H ( x i E ) ~ H(AA'X 1 , ...,X m )- mH(E) 
i=i 

m 

= I (A; Xi; . . . ; X m \A'E) + ^ [H(A'E) + H(X t E) - H(E) - H(AX t E)} 
> I(A;X 1 ;...;X m \A'E). 

The inequality is true by strong subadditivity. □ 

Remark It is interesting to note that we have two very similar reduction- 
of-systems formulas derived from different perspectives. From Lemma 4.3 
(monotonicity of the multiparty information) we have that 

I(AB; X l5 . . . ; X m \E) > I (A; X x ; . . . ; X m \E), (4.11) 

but we also know from Lemma 4.4 (chain-type rule) that 

I(AB; X l5 . . . ; X m \E) > I {A; X 1 ;... ] X m \BE). (4.12) 
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The two expressions are inequivalent; one is not strictly stronger than the 
other. We use both of them depending on whether we want to keep the 
deleted system around for conditioning. 

4.2 Multiparty squashed entanglement 

Using the definition of the conditional multiparty information from the 
previous section, we can define a multiparty squashed entanglement analogous 
to the bipartite version [61, 62, 17]. The multiparty squashed entanglement 
has recently been investigated independently by Yang et al. [16]. 

Definition 4.5 (Multiparty squashed entanglement). Consider the density 
matrix p x ± x 2--- x ™ shared between m parties. We define the multiparty squashed 
entanglement in the following manner 



where the minimization happens over all states of the form p x ^ x z---X m E suc ^ 
that Ti E (p XlX2 - XmE ) = p Xll2 '- Xm . (We say p is an extension of p.) 

The dimension of the extension system E can be arbitrarily large, which 
is in part what makes calculations of the squashed entanglement very difficult 
except for simple systems. The motivation behind this definition is that we can 
include a copy of all classical correlations inside the extension E and thereby 
eliminate them from the multiparty information by conditioning. Since it is 
impossible to copy quantum information, we know that taking the infimum 
over all possible extensions E we will be left with a measure of the purely quan- 
tum correlations. The definition of E sq as a minimization over a conditional 
mutual information is motivated by the classical cryptography notion of in- 
trinsic information which provides a bound on the secret-key rate [63, 64, 17]. 




\miI{X 1 -X 2 -----X m \E) p 

Z E 



(4.13) 
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Example: It is illustrative to calculate the squashed entanglement for sep- 
arable states, which are probabilistic mixtures of tensor products of local pure 
states. Consider the state 

p x lX2 ...x m = Y^ p .\ a .)( aj \xi 1(3^(3^ ® • • • \Q)(Q\ X -, 
j 

which we choose to extend by adding a system E containing a record of the 
index j as follows 

~x lX2 ...x mE = ^pM^a^ ® |&></^ ® • • • \Q(Q\ X ~ ® |j)0f . 
i 

When we calculate conditional entropies we notice that for any subset /C C 
{1,2,. ..m}, 

H(Xjc\E)p = 0. (4.14) 

Knowledge of the classical index leaves us with a pure product state for 
which all the relevant entropies are zero. Therefore, separable states have 
zero squashed entanglement: 

1 



E S q(Xi, X 2 ; . . . ; X m ) p — 



J2 H{Xi\E)- p - H(X 1 X 2 . . . X m \E) f 



0. 



We now turn our attention to the properties of E sq . Earlier we argued 
that the squashed entanglement measures purely quantum contributions to 
the mutual information between systems, in the sense that it is zero for all 
separable states. In this section we will show that the multiparty squashed 
entanglement cannot increase under the action of local operations and clas- 
sical communication, that is, that E sq is an LOCC-monotone. We will also 
show that E sq has other desirable properties; it is convex, subadditive and 
continuous. 
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Proposition 4.6. The quantity E sq is an entanglement monotone, i.e. it 
does not increase on average under local quantum operations and classical 
communication (LOCC). 

Proof. In order to show this we will follow the argument of [17], which in turn 
follows the approach described in [65]. We will show that E sq has the following 
two properties: 

1. Given any unilocal quantum instrument £ k (a collection of completely 
positive maps such that J2k^k * s trace preserving [66]) and any quantum 
state p x i- x ™ y then 

E sq {Xi, X 2 ; . . . X m ) p > ^2PkE sq (Xi, X 2 ; . . . X m )p k (4-15) 

k 

where 

p k = Tr £ k {p x '- Xm ) and = -E k (p x i~ x ™). (4.16) 

Pk 

2. E sq is convex. 

Without loss of generality, we assume that £ k acts on the first system. 
We will implement the quantum instrument by appending to X\ environment 
systems X[ and X" prepared in standard pure states, applying a unitary U 
on XiX[X", and then tracing out over X". We store k, the classical record 
of which £ k occurred, in the X[ system. More precisely, for any extension of 
p x lX2 -x m tQ Xl X 2 -..X m E, 

pXl x 2 ...x m E ^ p x lX[ x 2 ...x m E ;= £^I E (p x ^- x - E ) ® \k) {k\ x 'i . (4.17) 

k 
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The argument is then as follows: 



l -I{X 1 -X 2 -...X m \E) p = l -I{X l X[X';-X 2 -...-X m \E) p (4.18) 

= h{X 1 X' 1 X';-X 2 ;...-X m \E) p - (4.19) 

> l -I{X l X[-X 2 -...-X m \E) p (4.20) 

> ^I(X 1 ;X 2 ;...;X m \EX[) p (4.21) 



= lY l p k I(X 1 ;X 2 ;...;X m \E!) h (4.22) 



2 

k 

> ^p fc E sq (X i; X 2 ;...;X m )- fc (4.23) 

k 

The equality (4.18) is true because adding an uncorrelated ancilla does not 
change the entropy of the system. The transition p — > p is unitary and doesn't 
change entropic quantities so (4.19) is true. For (4.20) we use the monotonicity 
of conditional multiparty information, Lemma 4.3. In (4.21) we use the chain- 
type rule from Lemma 4.4. In (4.22) we use the index information k contained 
in X[. Finally, since E S(l is the infimum over all extensions, it must be no 
more than the particular extension E, so (4.23) must be true. Now since the 
extension E in (4.18) was arbitrary, it follows that E sq (Xi; X 2 ; . . . ; X m ) p > 
J2 k PkE sq (Xi, X 2 ; . . . ; X m )~ k which completes the proof of Property 1. 

To show the convexity of E SCL , we again follow the same route as in [17]. 
Consider the states p x ^- x ^ an d a XlX2 - Xm and their extensions p x ^- x ^ E 
and g- x i x 2-X m E d e fi ne d over the same system E. We can also define the 
weighted sum of the two states T x ^- X ^ = x p x 1 x 2 ...x m + (i _ \} a XiX 2 ...x m 
and the following valid extension: 

fXl x a ...x m EE> = Xp x lX2 ...x m E | OX o|^' + (1 _ A)(T x 1 x 2 ...x m s (4.24) 
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Using the definition of squashed entanglement we know that 

E sq (Xi, X 2 ; ■ . .; X m ) T 

^ -^I{Xi'-,X 2 \ . . .\X m \EE') f 

= X - [A/(X i; X 2 ; . . . ; X m \E)~ p + (1 - X)I(X 1 ; X 2 ; . . . ; X m \E) a ] . 
Since the extension system E is completely arbitrary we have 

EsqiXi, . . . ; X m ) T < \E sq (Xi; . . . ; X m ) p + (1 — A)£ l sq (X 1 ; . . . ; X m ) a , 
so Esq is convex. 

We have shown that E sq satisfies both Properties 1 and 2 from page 51. 
Therefore, it must be an entanglement monotone. □ 

Subadditivity on Product States Another desirable property for mea- 
sures of entanglement is that they should be additive or at least subadditive 
on tensor products of the same state. Subadditivity of E scl is easily shown 
from the properties of multiparty information. 
Proposition 4.7. E sq is subadditive on tensor product states, i.e. 

E sq {XiYi\ . . . ; X m Y m ) p < E sq (Xi\ . . .; X m ) p + E sq (Yi, . . . ; Y m ) p (4.25) 
where pXiYiX2Y2...x m Y m _ pX\X2---X m ^ pYiY2...Y m 

Proof. Assume that p x i x 2-XmE an j pY 1 Y 2 ...Y m E' are ex t e nsions. Together they 
form an extension p^^x^-XmYmEE' f Qr ^ e product state. 

2-E'sq(A A iFi; X 2 Y 2 ; . . . ; X m Y m ) p 

< I(X 1 Y 1 ;X 2 Y 2 ;...;X m Y m \EE') 

= H(XiYiEE') - H(X X Y X X 2 Y 2 . . . X m Y m EE') - (m - l)H(EE') 
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= J(X i; X 2 ; . . . ; X m \E) + J(Y i; Y 2 ; . . . ; F m |£"). (4.26) 

The first line holds because the extension for the XY system that can be built 
by combining the X and Y extensions is not the most general extension. The 
proposition then follows because the inequality holds for all extensions of p 
and a. □ 

The question of whether E sq is additive, meaning superadditive in addi- 
tion to subadditive, remains an open problem. Indeed, if it were possible to 
show that correlation between the X and Y extensions is unnecessary in the 
evaluation of the squashed entanglement of p <8> a, then E sq would be additive. 
This is provably true in the bipartite case [17] but the same method does not 
seem to work with three or more parties. 

Continuity The continuity of bipartite E sq was conjectured in [17] and 
proved by Alicki and Fannes in [67]. We will follow the same argument here 
to prove the continuity of the multiparty squashed entanglement. The key to 
the continuity proof is the following lemma which makes use of an ingenious 
geometric construction. 

Lemma 4.8 (Continuity of conditional entropy [67]). Given density matrices 
p AB and <j ab on the space Ti A <S> 7~L B such that 

||p -(t||i = ^^-(71 <e, (4.27) 

it is true that 

\H(A\B) P - H(A\B) a \ < 4e\ogd A + 2h(e) (4.28) 

where d A = dimH A and h(e) = — eloge — (1 — e)log(l — e) is the binary 
entropy. 
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This seemingly innocuous technical lemma makes it possible to prove the 
continuity of E sq in spite of the unbounded dimension of the extension system. 
Proposition 4.9 (E sq is continuous). For all states p x ^-X m ? a x^x 2 ...x m 
with \\p — a\\i < e, \\E sq (p) — E sg (a)\\ < e' where e' depends on e and vanishes 
as e — > 0. 

The precise form of e' can be found in equation (4.35). 

Proof. Proximity in trace distance implies proximity in fidelity distance [40], 
in the sense that 

F ( p x 1 x a ...x m ^ (T x 1 x a ...x m j > i _ e> (4.29) 
but by Uhlmann's theorem [68] this means that we can find purifications 

^X lX2 ...X m R ^X lX2 ...X m R guch that 

F ^x lX2 ...x mR ^ ^ Xl x 2 ...x mR ^ > : _ e ( 4>30 ) 

Now if we imagine some general operation A that acts only on the purifying 
system R 

pXl X 2 ...X m E = ^X,X 2 ...X m ^ ^E^ p \X 1 X 2 ...X m R (43^ 
a X lX2 ...X m E = (j Xl X a ...X m ^ A R-*E^ a \ Xl X 2 ...X m R (433) 

we have from the monotonicity of fidelity for quantum channels that 

F ( p X 1 X a ...X m E^ a X 1 X 2 ...X m E} > F ^X 1 X 2 ...X m R ^^X^X^ > X _ £> (4 33) 

which in turn implies [40] that 

^ p x 1 x 2 ...x m E _ a x 1 x 2 ...x m E^ < 2 ^ ^4^34) 
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Now we can apply Lemma 4.8 to each term in the multiparty information to 
obtain 



I(Xi, X 2 ; . . . X m \E) p — I(Xi, X 2 ; . . . X m \E) c 



< 



^HiX^r-HiX^EX 



i=i 



+ 



H(XiX 2 . . . X m \E) p — H(XiX 2 . . . X m \E) a 

m / m \ 

< i 8 ^ lo S d i + ZhpVe)] + 8<y/i log I JJ ] + 2h(2y/e) 
i=i \i=i ) 

= 16v^log ( f[d t ) + (m+ l)2h(2y/e) =: e' (4.35) 



where di = dimH Xi and h(.) is as defined in Lemma 4.8. Since we have shown 
the above inequalities for any extension E and the quantity e' vanishes as 



e — > 0, we have proved that E sq is continuous. 



□ 



4.3 Example calculations of E scl 

Below we give several examples of simple systems where E sq is calculated 
to gain intuition about how it behaves. As a first step we verify that E sq is 
zero for states that are manifestly not entangled. 

Example 1: Fully decoupled state Given the state p ^ x '- x ^ = p x i ® 
p x ' 2 <S> ■ • ■ <E> p Xm = <S>T P Xi ^e mutual information for this state is: 

m 

I(X 1 ; X 2 - . . . ■ X m ) = V H(Xi) - H(X U X 2 , ...,X m ) = (4.36) 

i v • ' 

=Y™H{X l ) 

which is to be expected since the state is a tensor product and cannot contain 
entanglement. 

In the next example we look at more complicated states where E sq is 
non-zero but simple to calculate. 
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Example 2: Partially separable state Now consider a state which is 
separable on all systems except for two. We write 



I Xn 



p AB Xl x 2 ...x m = Y^ Pj \a^\ AB ® \WA Xl ® • • • 10X0 
j 

and an extension E that records the index j. For this extension we will have: 
I(A;B;X 1 ;...;X m \E) = 

m 

= H(A\E) + H{B\E) + H{Xi\E) - H{ABX l ■ ■ ■ X m \E) 

i 

m 

= H(AE)+H(BE)+J2H(X i E)-H(ABX 1 - ■ ■ X m E)-(m+l)H{E) 

> (1) H(AE) + H(BE) - H(ABE) - H(E) 
= I{A-B\E) 
> 2E sq (A;B) p 

To show (1) we repeatedly used the strong subadditivity property of von Neu- 
mann entropy 

-H(E)-H(EY 1 ---Y m ) > -H(EY m )-H(EY 1 ---Y m _ 1 ). (4.37) 

Thus we have shown that for partially separable states, E sq of the whole is at 
least as much as its non-separable part. 

Example 3: E scl for the GHZ and W states Consider the m-party GHZ 
state \GHZ) XlX2 '~ Xm = |o) 8 "^r) 0m and the m-party W state \W) XlX ' 2 - Xm = 
7k ^7=v N)> where \i) = |0 • • - 01^_^0). In particular, the three-party GHZ 

i 

and W states correspond to 

\GHZ) = |0 ° 0) ^ |m) and |W> = -L (|001> + |010) + |100» . 
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The squashed entanglement of the the general GHZ state is 



E sq (Xi, X 2 ; ■ ■ ■ ; X m ) GHZ — -inf I(Xi,X 2 ; • • • ;X m \E) 

Z E 



(pure state) 

m 

- H(Xi . . . X m ) 



max. mixed 



pure 



1 

2 

m 
2~ 



For the W state, the 1-qubit reduced systems are of the form 
Tr X2 ...x m (\W)(W\) 



V 



(4.38) 



/ 



± 

m 

and so the squashed entanglement for the W state is given by the formula 



E sq (Xi, X 2 ; • • • ; X m ) 



w 



1 

2 
1 

2 

rn 



I(Xi, X 2 ; • • • ; X m ) 



Y j H{X i )-H{X 1 ...X m ) 







m 



9 l0g2 I , 
v (m — 1) 



= o lQ g2 



m 



(m - 1 )("»-!) 



-log 2 m + 0(l) «-. 



We can see that the GHZ state is maximally multiparty entangled whereas 
the W state contains very little multiparty entanglement. 



CHAPTER 5 
Multiparty distributed compression 

Distributed compression of classical information, as discussed in Sec- 
tion 2.1.6, involves many parties collaboratively encoding their classical sources 
Xi,X 2 ■ ■ ■ X m and sending the information to a common receiver [34]. In the 
quantum setting, the parties are given a quantum state <^A A 2-A m e ^A 1 A 2 -A m 
and are asked to individually compress their shares of the state and transfer 
them to the receiver while sending as few qubits as possible [14]. We have 
already discussed a version of quantum distributed compression in Section 3.3 
where we used shared entanglement and classical communication to accom- 
plish the task [43]. In this chapter, we consider the fully quantum scenario 
where only quantum communication is used and classical communication is 
forbidden. 

In our analysis, we work in the case where we have many copies of the 
input state, so that the goal is to send shares of the purification \^ AlA2 " AmR — 
(|(/9) AlA2 '" Ami? )®™, where the A^s denote the m different systems and R denotes 
the reference system, which does not participate in the protocol. A word on 
notation is in order. We use Ai to denote both the individual system associated 
with state (p as well the n-copy version Af n associated with ip; the intended 
meaning should be clear from the context. We also use the shorthand notation 
A = AiA 2 ■ ■ -A m to denote all the senders. 

The objective of distributed compression is for the participants to transfer 
their i?-entanglement to a third party Charlie as illustrated in Figure 5-1. As 
discussed in Section 2.2.5, preserving the i?-entanglement means our protocol 
has high entanglement fidelity [41] which guarantees that we can transfer the 
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state (p A i A 2- A ™ ^ a i so preserve all the correlations this state has with the 
rest of the world. 

Charlie Charlie Charlie 




R R 



Figure 5-1: Pictorial representation of the quantum correlations between the 
systems at three stages of the protocol. Originally the state is shared be- 
tween A\A 2 • • ■ A m and R. The middle picture shows the protocol in progress. 
Finally, all systems are received by Charlie and is now shared between 
Charlie's systems AiA 2 ■ ■ ■ A m and R. 

An equivalent way of thinking about quantum distributed compression is 
to say that the participants are attempting to decouple their systems from the 
reference R solely by sending quantum information to Charlie. Indeed, if we 
assume that originally R is the purification of A\A 2 • • • A m , and at the end of 
the protocol there are no correlations between the remnant W systems (see 
Figure 5-1) and R, then the purification of R must have been transferred to 
Charlie's laboratory since none of the original information was discarded. 

To perform the distributed compression task, each of the senders indepen- 
dently encodes her share before sending part of it to Charlie. The encoding 
operations are modeled by quantum operations (CPTP maps) £j with outputs 
Ci of dimension 2 nQi . Once Charlie receives the systems that were sent to him, 
he will apply a decoding operation V, with output system A = A\A 2 . . . A m 
isomorphic to the original A — A X A 2 . . . A m . 

— # 

Definition 5.1 (The rate region). We say that a rate tuple Q = (Qi, Q 2 , . . . , Q m ) 
is achievable if for all e > there exists N(e) such that for all n > N(e) 
there exist n- dependent maps (£i,£ 2 , . . . ,£ m ,V) with domains and ranges as 
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in the previous paragraph for which the fidelity between the original state, 
isfies 

We call the closure of the set of achievable rate tuples the rate region. 

5.1 The multiparty FQSW protocol 

Like the original FQSW protocol, the multiparty version relies on Schu- 
macher compression and the mixing effect of random unitary operations for 
the encoding. The only additional ingredient is an agreed upon permutation 
of the participants. The temporal order in which the participants will perform 
their encoding is of no importance. However, the permutation determines how 
much information each participant is to send to Charlie. 

For each permutation n of the participants, the protocol consists of the 
following steps: 

1. Each Alice-i performs Schumacher compression on her system Ai reduc- 
ing its effective size to the entropy bound of roughly if (A) qubits per 
copy of the state. 

2. Each participant applies a known, pre-selected random unitary to the 
compressed system. 

3. Participant % sends to Charlie a system Cj of dimension 2 nQl where 

Q i >^I(A i ;A K .R) v (5.1) 

where /Q = {"n(j) : j > n' 1 ^)} is the set of participants who come after 
participant i according to the permutation. 
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4. Charlie applies a decoding operation D consisting of the composition of 
the decoding maps P^ m ) o • • • o T>^ 2 ) ° ^41) defined by the individual 
FQSW steps in order to recover a A i A 2--- A ™ nearly identical to the original 

^A 1 A 2 -A m anc j p Ur ifyj n g R 

Note that, in order to perform the decoding operation V, Charlie needs 
to know which random unitaries which were used in the individual encoding 
operations £j. We assume this information is shared before the beginning of 
the protocol in addition to the permutation ir. 
5.1.1 Statement of results 

This section contains our two main theorems about multiparty distributed 
compression. In Theorem 5.2 we give the formula for the set of achievable 
rates using the multiparty FQSW protocol (sufficient conditions). Then, in 
Theorem 5.3 we specify another set of inequalities for the rates Qi which must 
be true for any distributed compression protocol (necessary conditions). In 
what follows, we consistently use K C {1,2,... m} to denote any subset of the 
senders in the protocol. 

Theorem 5.2. Let |^}^"^ be a pure state. If the inequality 



Y^mAkU + H(R) V - H(RA K \ 



(5.2) 

holds for all K C {1,2,..., m}, then the rate tuple (Q±, Q2, • • • , Q m ) is achiev- 
able for distributed compression of the Ai systems. 

Because Theorem 5.2 expresses a set of sufficient conditions for the pro- 
tocol to succeed, we say that these rates are contained in the rate region. The 
proof is given in the next section. 

In the m-dimensional space of rate tuples Q2, • • • , Qm) £ the 
inequalities (5.2) define a convex polyhedron [69] whose facets are given by 
the corresponding hyperplanes, as illustrated in Figure 5-2. More specifically, 
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the rate region is a supermodular polyhedron [70], which means that it has 
some special properties that will help us in the proof of Theorem 5.2. 




Figure 5-2: The rate region for the multiparty FQSW protocol with three 
senders. 



In order to characterize the rate region further we formulate Theorem 5.3, 
an outer bound on the rates that must be satisfied for all distributed com- 
pression protocols. 



Theorem 5.3. Let \cp) 



A 1 A 2 -A m R 



be a pure state input to a distributed com- 



pression protocol which achieves the rate tuple (Qi, Q2, ■ ■ ■ , Qm), then it must 
be true that 



x ^ ^ 1 



fce/c 



Y}H{A k )^\ + H(R) V - H(RAjc) 



.fce/c 



(5.3) 

for all K C {1,2,..., m}, where E sq is the multiparty squashed entanglement. 



The multiparty squashed entanglement was defined in Section 4.2 above. 
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Notice that Theorems 5.2 and 5.3 both provide bounds of the same form 
and only differ by the presence of the E sq term. The rate region is squeezed 
somewhere between these two bounds as illustrated in Figure 5-3. 

Q/3 A 



Figure 5-3: A two dimensional diagram showing the inner bound from The- 
orem 5.2 and the outer bound from Theorem 5.3. The boundary of the real 
rate region must lie somewhere in between. 

For states which have zero squashed entanglement, the inner and outer 
bounds on the region coincide so that in those cases our protocol is an optimal 
solution to the multiparty distributed compression problem. 

5.2 Proof of inner bound 

The multiparty fully quantum Slepian-Wolf protocol can be constructed 
directly [71] or through the repeated application of the two-party FQSW pro- 
tocol [5] . We choose the latter approach here in order to illustrate the power of 
the FQSW protocol as a building block for more complex protocols. To com- 
plete the proof we will have to "stitch together" different achievable points 
using some concepts from the theory of polyhedra [69]. The multiparty rate 
region has a complex but regular geometry so it is important that we use the 
right language to describe it. The geometry of multiparty rate regions has 
previously been discussed in [70, 72]. 




The rate region boundary 



Theorem 5.3 



Q 



a 
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For every permutation n G S m of the m senders, there is a different rate 
tuple q w = (Qi, Q 2 , ■ ■ ■ , Qm) n £ K m which is achievable in the limit of many 
copies of the state. By time-sharing we can achieve any rate that lies in the 
convex hull of these points. We will show that the rate region for an input 
state \ip) Al "' AmR can equivalently be described by the set of inequalities from 
Theorem 5.2, that is 

where /C C {1,2,..., m) ranges over all subsets of participants and C/e is the 
name we give to the constant on the right hand side of the inequality. The 
proof of Theorem 5.2 proceeds in two steps. First we show the set of rate 
tuples {q n } is contained in the rate region and then we prove that the set of 
inequalities (5.4) is an equivalent description of the rates obtained by time 
sharing and resource wasting of the rates {q n }. 

Consider the m-dimensional space of rate tuples (Qi, • • • , Q m ) G M. m . We 
begin by a formal definition of a corner point q n . 

Definition 5.4 (Corner point). Let n G S m be a permutations of the senders 
in the protocol. The corresponding rate tuple q n = (Qi, Q 2 , • • • , Q m ) is a corner 
point if 

Q-rtk) = ^(Aw; A^t+i) ■ ■ ■ A^ m )R) (5.5) 

where the set A-^k+i) • • • denotes all the systems which come after k in 

the permutation ir. 

We define Q := {q n : n G S m }, the set of all corner points. Clearly, 
\Q\ < ml but since some permutations might lead to the same rate tuple, the 
inequality may be strict. 

Lemma 5.5. The set of corner points, Q = {q n : n G S m }, is contained in 
the rate region. 



J2 H ( A k)v + H(R) ip - H(RA K ) V 



--■■C K 



(5.4) 



LketC 
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Proof sketch for Lemma 5. 5. We will now exhibit a protocol that achieves one 
such point. In order to simplify the notation, but without loss of generality, 
we choose the reversed-order permutation n = (m, ... ,2, 1). This choice of 
permutation corresponds to Alice- m sending her information first and Alice- 1 
sending last. 

We will repeatedly use the FQSW protocol is order to send the m systems 
to Charlie: 

1. The first party Schumacher compresses her system A m and sends it to 
Charlie. She succeeds provided 

Q m > \l{A m - A X A 2 . . . An-xR) + 5 = H(A m ) + 5 

for any 5 > 0. The above rate is dictated by the FQSW inequality (3.25) 
because we are facing the same type of problem except that the "refer- 
ence" consists of R as well as the remaining participants A\A 2 ■ ■ ■ A m -\. 
The fact that the formula reduces to Q m > H(A m ) should also be ex- 
pected since there are no correlations that the first participant can take 
advantage of; she is just performing Schumacher compression. 

2. The second party also faces an instance of an FQSW problem. The task 
is to transmit the system A m _i to Charlie, who is now assumed to hold 
A m . The purifying system consists of A X A 2 ■ ■ ■ A m _ 2 R. According to 
inequality (3.25) the rate must be 

Qm-l > A X A 2 ■ ■ ■ A m _ 2 R) + 5 

for any 5 > 0. 
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3. The last person to be merging with Charlie will have a purifying system 
consisting of only R. Her transfer will be successful if 

Qi > \l{A 1 -R) + 5 

for any 5 > 0. 

On the receiving end of the protocol, Charlie will apply the decoding map 
V consisting of the composition of the decoding maps V 1 oV 2 o- ■ -oD m defined 
by the individual FQSW steps to recover the state a AlAi " ' Am , which will be 
such that the fidelity between \ip) and a is high, essentially by the 
triangle inequality. Finally, because we can make 5 arbitrarily small, the rate 
tuple (Qi, ■ ■ ■ ,Q m ), with 

Q k = ^I(A k ;A 1 ---A h . 1 R), (5.6) 

must be contained in the rate region. The same argument applies for each 
permutation n G S m , leading to the conclusion that the full set Q is contained 
in the rate region. □ 

Each one of the corner points q n can also be described by an equivalent 
set of equations involving sums of the rates. 

Lemma 5.6. The rate tuple (Qi, Q 2 , ■ ■ ■ , Q m ) is a corner point if and only if 
for some n G S m and for all I such that 1 < / < m, 



= \ 



m—l+l<k<m 



^ H{A^ k )) + H(R) - H(A^ m ^i +lym ]R) 

m—l+l<k<m 



(5.7) 

where A^ m _ l+ljin } := A^ m _ m) A^ m _ l+2 ) ■ ■ ■ A^ m ) denotes the last I partici- 
pants according to the permutation n. 

Proof of Lemma 5.6. The proof follows trivially from Lemma 5.5 by consider- 
ing sums of the rates. If we again choose the permutation n = (m, . . . , 2, 1) 
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for simplicity, we see that the sum of the rates of the last I participants is 
1 



Qi + ■ ■ ■ + Qi = 2 
_ l 

~~ 2 



I(A i; R) + I(A 2 ; A l R) + ■■■ + I(A f , A, ■ ■ ■ A^R) 



H{A k ) + H{R) - H(Ai ■ ■ ■ A t R) 



= C 12 ..j. (5.8) 



■ l<k<l 

A telescoping effect occurs and most of the inner terms cancel so we are left 
with a system of equations identical to (5.7). Moreover, this system is clearly 
solvable for the individual rates Qk- The analogous simplification occurs for 
all other permutations. □ 

So far, we have shown that the set of corner points Q is contained in 
the rate region of the multiparty fully quantum Slepian-Wolf protocol. The 
convex hull of a set of points Q is defined to be 



conv 



(Q) := {?Gl m : x = Aift, qi G Q, A, > 0, ^ A, = l} . (5.9) 



Because of the possibility of time-sharing between the different corner points, 
the entire convex hull conv(Q) must be achievable. Furthermore, by simply 
allowing any one of the senders to waste resources, we know that if a rate tuple 
q is achievable, then so is q + w for any vector w with nonnegative coefficients. 
More formally, we say that any q + cone(ei, e 2 , . . . , e m ) is also inside the rate 
region, where {ej} is the standard basis for M m : e*j = (0, 0, . . . , 0, 1, 0, 0) and 



cone 



;(ei, -,e m ):={fGR m : x = ^ \e u \ > o} . (5.10) 
Thus, we have demonstrated that the set of rates 

P v := conv(Q) + cone{e x , ■ ■ ■ , e m ) (5.11) 
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is achievable. To complete the proof of Theorem 5.2, we will need to show 
that Py has an equivalent description as 

P n :=\(Qu--- ,Q m )eR m : J2^>C K ,y)CC{l,2,...,m}\, (5.12) 

where the constants C/c are as defined in equation (5.4). This equivalence is 
an explicit special case of the Minkowski- Weyl Theorem on convex polyhedra. 

Theorem 5.7 (Minkowski- Weyl Theorem). [69, p. 30] For a subset P C R m , 
the following two statements are equivalent: 

• P is a V -polyhedron: the sum of a convex hull of a finite set of points 
V = {pi} plus a conical combination of vectors W = {vJi} 

P = conv(P) + cone(W) (5.13) 

where conv(V) andconeiW) are defined in (5.9) and (5.10) respectively. 

• P is a H-polyhedron: an intersection of n closed half spaces 

P = {x E R m : Ax > a} (5.14) 

for some matrix A G M nxm and some vector a G M n . Each of the n rows 
in equation (5.14) defines one half space. 

Preliminaries Before we begin the equivalence proof in earnest, we make 
two useful observations which will be instrumental to our subsequent argu- 
ment. First, we prove a very important property of the constants C/c which 
will dictate the geometry of the rate region. 

Lemma 5.8 (Superadditivity). Let JC, C C {1,2, ... ,m} be any two subsets 
of the senders. Then 

C/cuc + Cicnc > Cic + Cc- (5.15) 
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Proof of Lemma 5. 8. We expand the C terms and cancel the ^-factors to ob- 
tain 

£ H(A k ) + H(R) - H(RA KUC ) £tf + H(R) - H(RA K ) 

keKuc > kefc 

+ H ( A k) + H(R) - H(RA KnC ) ' + £#(A fe ) + if (fl) - ff(iM £ ). 

After canceling all common terms we find that the above inequality is equiv- 
alent to 

H{RA K ) + H(RA C ) > H(RA KUC ) + H(RA KnC ), (5.16) 

which is true by the strong subadditivity (SSA) inequality of quantum en- 
tropy [36]. □ 

As a consequence of this lemma, we can derive an equivalence property 
for the saturated inequalities. 

Corollary 5.9. Suppose that the following two equations hold for a given point 
ofPn- 

J2Qk = C K and £Q fc = C £ . (5.17) 

kaK. kec 

Then the following equations must also be true: 

£ Qk = Ctcuc and £ Q k = C Kr]C - (5.18) 
k£KU£ keKnc 

Proof of Corollary 5. 9. The proof follows from the equation 

fcefc kec keKuc keKnc 

where the first inequality comes from Lemma 5.8. The second inequality is 
true by the definition of since /C U C and /C H C are subsets of {1, 2, ... , m}. 
Because the leftmost terms and rightmost terms are identical, we must have 
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equality throughout equation (5.19), which in turn implies the the union and 
the intersection equations are saturated. □ 

An important consequence of Lemma 5.8 is that it implies that the poly- 
hedron Pji has a very special structure. It is known as a supermodular poly- 
hedron or contra-polymatroid. The fact that conv(Q) = P^ was proved by 
Edmonds [70] , whose ingenious proof makes use of linear programming duality. 
Below we give an elementary proof that does not use duality. 

A vertex is a zero-dimensional face of a polyhedron. A point Q = 
(Qi, Q2, ■ ■ ■ , Qm) £ -Ph C W rn is a vertex of P n if and only if it is the unique 
solution of a set of linearly independent equations 

S £ J Qk = C c ^ l<i<m (5.20) 

keCi 

for some subsets Ci C {1,2, ... ,m}. In the remainder of the proof we re- 
quire only a specific consequence of linear independence, which we state in the 
following lemma. 

Lemma 5.10 (No co-occurrence). Let Ci C {1,2,..., m} be a collection of m 
sets such that the system (5.20) has a unique solution. Then there is no pair 
of elements j , k such that j e Ci if and only if k G Ci for all i. 

Proof. If there was such a pair j and k, then the corresponding columns of 
the left hand side of (5.20) would be linearly dependent. □ 

Armed with the above tools, we will now show that there is a one-to- 
one correspondence between the corner points Q and the vertices of the 7i- 
polyhedron P^. We will then show that the vectors that generate the cone 
part of the 7i-polyhedron correspond to the resource wasting vectors {e*j}. 
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Step 1: Q C vertices(P-n) We know from Lemma 5.6 that every point 
q n G Q satisfies the m equations 



^ ^ Q-n(k) C-nfm.— i+l,m] j 
m— i+l<k<m 



1 <i < m. 



(5.21) 



The equations (5.21) are linearly independent since the left hand side is 
triangular, and have the form of the inequalites in (5.12) that are used to 
define Pn- They have the unique solution: 

Q-jim) — Cir(m) Q*(i) = C-n[i,m\ ~ C*[i+l,m] , 1 < i < m — 1. (5.22) 

We need to show that this solution satisfies all the inequalities used to define 
Pu in (5.12). We proceed by induction on \K.\. The case \JC\ = 1 follows 
from (5.22) and the superadditivity property (5.15). For |/C| > 2 we can write 
K. = {7i(i)} U K! for some K! C {vr(i + 1), %(i + 2), . . . , vr(m)}. Then 



keK. 



fcefc' 



(induction) 



where we again used superadditivity to get the last inequality. 

Step 2: vertices(Pn) ^ Q In order to prove the opposite inclusion, we will 
show that every vertex of P-u is of the form of Lemma 5.6. More specifically, 
we want to prove the following proposition. 

Proposition 5.11 (Existence of a maximal chain). Every vertex of P n , that 
is, the intersection of m linearly independent hyperplanes 

^Q fc = C A , l<t<m, (5.23) 

keCi 
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defined by the family of sets {Ci, 1 < i < m} can be described by an equivalent 
set of equations 

J2Qk = C ^ l<t<m, (5.24) 

kGlCi 

for some family of sets distinct /Q C {1,2,..., m} that form a maximal chain 
in the sense of 

= /C C /Ci C /C 2 C • • • C /C m _! C K m = {1, 2, . . . , to}. (5.25) 

Since there exists a permutation 7r such that Wi, 7r[m — i + 1, to] = /Q this 
implies that all the vertices of Pn are in Q. The main tool we have have at 
our disposal in order to prove this proposition is Corollary 5.9, which we will 
use extensively. 

Proof of Proposition 5.11. Let x be the subsets of {1, 2, ... , to} for which 

the inequalities are saturated and define Cf := Ci fl S, the intersection of Ci 
with some set S C {1,2,..., to}. 
Construct the directed graph G = (V,E), where: 

• V = {1, 2, ... , m}, i.e. the vertices are the numbers from 1 to to; 

• E = {(j, k) : (Vi) j G £j =^ G £j }, i.e. there is an edge from ver- 
tex j to vertex k if whenever vertex j occurs in the given subsets, then 
so does vertex k. 

Now G has to be acyclic by Lemma 5.10, so it has a topological sorted order. 
Let us call this order v. Let /C = and let 

/Q = {v m -i + i, . . . ,v m } (5.26) 

for / G {1, . . . ,to}. The sets /Q, which consist of the last / vertices according 
to the ordering u, form a maximal chain /Co C fCi C • • • C /C m _i C /C m by 
construction. 
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We claim that all the sets /Q can be constructed from the sets by 
using unions and intersections as dictated by Corollary 5.9. The statement 
is true for /C m = {1,2, .. . , m} because every variable must appear in some 
constraint equation, giving K, m = Uj£j. The statement is also true for /C m _i = 
{z/ 2 , u rn } since the vertex v\ has no in-edges in G by the definition of a 
topological sort, which means that 

/C m _!= |J Cf m . (5.27) 

For the induction statement, let I G {m — 1, . . . , 2, 1} and assume that /Q = 
[j i Cf l . Since the vertex v m ^\ has no in-edges in the induced subgraph gener- 
ated by the vertices /Q by the definition of the topological sort, /Q_i can be 
obtained from the union of all the sets not containing u m -i' 

£,_!= |J Cf'. (5.28) 

In more detail, we claim that for all uj ^ v m -\ G K-i-i there exists i such that 
v m -i <£. Cf 1 and uj G Cf 1 . If it were not true, that would imply the existence 
of uj 7^ v m -i £ A^z i such that for all i, v m _i G Cf 1 or w ^ This last 

condition implies that whenever uj G Cf 1 it is also true that Vm-i £ ^f^ ! , which 
corresponds to an edge {uj, f n -i) in the induced subgraph. □ 

We have shown that every vertex can be written in precisely the same form 
as Lemma 5.6 and is therefore a point in Q. This proves vertices(Pfi) C Q, 
which together with the result of Step 1, implies vertices(Pn) = Q- 

Step 3: Cone Part The final step is to find the set of direction vectors 
that correspond to the cone part of P n . The generating vectors of the cone are 
all vectors that satisfy the homogeneous versions of the halfspace inequalities 
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(5.14), which in our case gives 



(5.29) 



ketc 



for all /C C {1,2, ... ,m}. These inequalities are satisfied if and only if 
Qk > for all k. We can therefore conclude that the cone part of P^ is 
cone(ei, e 2 , ...,e m ). 



This completes our demonstration that Py is the V-polyhedron description 
of the 7i-polyhedron P n . Thus we arrive at the statement we were trying to 
prove; if the inequalities 



52Qk>C K = l 



keK. 



keK. 



(5.30) 



are satisfied for any /C C {1,2,..., m}, then the rate tuple (Qi, Q2, ■ ■ ■ ,Q r 
is inside the rate region. This completes the proof of Theorem 5.2. 



5.3 Proof of outer bound 

We want to show that any distributed compression protocol which works 
must satisfy all of the inequalities (5.3) from Theorem 5.3. In order to prove 
this, we will use some of the properties of multiparty information and squashed 
entanglement. We break up the proof into three steps. 

Step 1: Decoupling Formula We know that the input system is 
a pure state. If we account for the Stinespring dilations of each encoding and 
decoding operation, then we can view any protocol as implemented by unitary 
transformations with ancilla and waste. Therefore, the output state (including 
the waste systems) should also be pure. More specifically, the encoding 
operations are modeled by CPTP maps Si with outputs Cj of dimension 2 n< ^ i . 
In our analysis we will keep track of the purification (waste) systems Wi of the 
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the Stinespring dilations Si, so the evolution as a whole will be unitary. 



10} 



Si 



Ci 
Wi 



to Charlie 

waste 



Once Charlie receives the systems that were sent to him, he will apply a 
decoding CPTP map V with output system A = A\A 2 . . . A m isomorphic to 
the original A — A\A 2 . . . A m . 



|o) - 



V 



A x --- A Tl 

W c 



near-purification of R 
Charlie's waste 



In what follows we will use Figure 5-4 extensively in order to keep track 
of the evolution and purity of the states at various points in the protocol. 

i i i 




Figure 5-4: A general distributed compression circuit diagram showing the 
encoding operations Si with output systems Cj (compressed data) and Wi 
(waste). The decoding operation takes all the compressed data <S>fii an d 
applies the decoding operation D to output a state a A " R " which has high 
fidelity with the original \ip) A R ■ 
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The starting point of our argument is the fidelity condition (Definition 5.1) 
for successful distributed compression, which we restate below for convenience 

F(\^) AnR \ a AnRn )>l-e (5.31) 

where \vp) A R = (\(p) AlA2 "' AmR ^ is the input state to the protocol and 
a A n R n j g ou tp U t state of the protocol. Since a AnRn has high fidelity with 
a rank one state, it must have one large eigenvalue 

A ma x(a >i? ")>l-6. (5.32) 

Therefore, the full output state \a) AR Wv " WmW c h as Schmidt decomposition 
of the form 



k) w - fffl = E V 7 ^ \ei) AnRn ® \f^- w - Wc , (5.33) 

i 

where |e») , are orthonormal bases and Ai = A max > 1 — e. 

Next we show that the output state \a) A R Wv " WmW c j s ver y c i ose [ n 
fidelity to a totally decoupled state a A " Rn ® a w 1 -w m w c ^ w hich is a tensor 
product of the marginals of \a) on the subsystems A n R n and W\ ■ ■ ■ W m Wc- 

= Tr ^a^^-Wc 0(7 ^-W c y 
= (l_ e )3>i_ 3e . (5.34) 

i 

Using the relationship between fidelity and trace distance [40], we can trans- 
form (5.34) into the trace distance bound 

\ a){a ^R"Wr-w m w c _ a A^ a w v -w m w c ^ < 2v ^ (5 _3 5) 

By the contractivity of trace distance, the same equation must be true for 
any subset of the systems. This bound combined with the Fannes inequality 
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implies that the entropies taken with respect to the output state are nearly 
additive: 



\H{R n W K ) c ■ H(R n ) a + H(Wk)c\ < 2y/Ze\og{d B nd WK )+ri{2y/Ze) 

< 2v / 3elog(d A n^«) + 77(2v / 3e) 

< n log(4) + r?(2 V3e) 



for any subset /C C {l,2...m} with e < and 77(2;) = — xlogx. In the 
second line we have used the fact that d A = dn and exploited the fact that dw K 
can be taken less than or equal to d A 2? , the maximum size of an environment 
required for a quantum operation with inputs and outputs of dimension no 
larger than d A ™- 

Step 2: Dimension Counting The entropy of any system is bounded 
above by the logarithm of its dimension. In the case of the systems that 
participants send to Charlie, this implies that 



We can use this fact and the diagram of Figure 5-4 to bound the rates Qi. 
First we add H(A£)^ = H(A£)^ to both sides of equation (5.37) and obtain 
the inequality 



]T Q fe > H(C K )v + if (4eV > H{C K A^, (5.38) 



For each encoding operation, the input system A,- L is unitarily related to the 
outputs CiWi so we can write 



= H(WiCi)v < H(Wi)v + H(Ci)y < H{Wi)^ + nQi, (5.39) 



(5.36) 




(5.37) 
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where in the last inequality we have used the dimension bound H(Ci) < nQi. 
If we collect all the Qi terms from equations (5.38) and (5.39), we obtain the 
inequalities 



nJ^Qt > H(C K Ajc) fl - H(Ajch (5.40) 
ietc 

i£fC i&K i&K, 

Now add equations (5.40) and (5.41) to get 

2n^2Qi> H ^ ~ H H{Wi)v + H(C K Ajc)^ - H(A£)^ 

i&K i&fC iS/C 



i&K i£tC 



— H(R n Afc)^, — fi(e,n) 

J^HiAi) + H(R n ) - H(R n A K ) 



(5.42) 

where the equality W comes about because the two systems \^) AkAr - r and 
^,\^c k w k a k r are p Ure ^ j ne q Ua ij^y (5_36) f rom Step 1 was used in ^ . 

Step 3: Squashed Entanglement We would like to have a bound on 
the extra terms in equation (5.42) that does not depend on the encoding 
and decoding maps. We can accomplish this if we bound the waste terms 
J2ieK H(Wi) a - H(W K ) a by the squashed entanglement 2E sq (A kl ; ■■■ ;A kl )^ 
of the input state for each /C = {ki, k^ . . . , ki} C {1, . . . , m} plus some small 
corrections. The proof requires a continuity statement analogous to (5.36), 
namely that 

\H(Wi)-H(W t \R)\ </ 2 (e,n) (5.43) 
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where f 2 is some function such that / 2 (e, n)/n — > as e — > 0. The proof is 
very similar to that of (5.36) so we omit it. 

Furthermore, if we allow an arbitrary transformation Af R ~' E to be applied 
to the R system, we will obtain some general extension but the analog of 
equation (5.43) will remain true by the contractivity of the trace distance 
under CPTP maps. We can therefore write: 

ietc 

<J2 H (Wi\E) - H{W K \E) + [|/C| + l]/ 2 (e,n) 
ietc 

= I(W kl ; W k2 ; . . . ; W k - E) - I(W kl ;E) - ]T I(W t ; E) + f 2 {e, n) 

ia{K\k-i} 

=W I(W kl E;W k2 ;...;W kl )- £ J(Wi; E) + £(e, n) 

< (2) I(A kl E;W k2 ;...;W kl ) - ^ J(Wi; E) + f 2 {e, n) 

ie{K\k-L} 

=W I(A kl ; W k2 ;...;W kl ,E)- I(A kl ;E) - £ I(W i; E) + /*(e, n) 

ie{K\ki} 

<( 3 ) I(A kl ; A k2 ; . . . ; A k - E) - '(A; E) + £(e, n) 

i<=K 

<I(A kl ;A k2 ;...;A kl \E) + f^(e,n), 

where we have used the shorthand f 2 (e,n) = [\fC\ + l]f 2 (e,n) for brevity. 
Equations marked W use Lemma 4.2 and inequality ^ comes about from 
Lemma 4.3, the monotonicity of the multiparty information. Inequality ^ is 
obtained when we repeat the steps for k 2 , . . . ,ki. The above result is true for 
any extension E but we want to find the tightest possible lower bound for the 
rate region so we take the infimum over all possible extensions E thus arriving 
at the definition of squashed entanglement. 
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Putting together equation (5.42) from Step 2 and the bound from Step 3 we 
have 



J2 H ( A i) + H ( Rn ) - H(R n A K ) 



- [Y.HiW^-HiW^ -/i(e,n) 



> 



^tf(A) + #(in-#(^4/e) 



- 2£ sq (A fel ; • • • ; A kl )j, - /i(e, n) - /^(e, n). 



We can simplify the expression further by using the fact that = \ip) to 
obtain 



.fee*: 



2n 



2n 



where the we used explicitly the additivity of the entropy for tensor prod- 
uct states and the subadditivity of squashed entanglement demonstrated in 
Proposition 4.7. 

Theorem 5.3 follows from the above since e > was arbitrary and the 
sum (/i(e, n) + f' 2 (e, n))/n — > as e — > 0. □ 
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5.4 Discussion 

The multiparty fully quantum Slepian-Wolf protocol is an optimal solu- 
tion to the distributed compression problem for separable states, i.e. states of 
the form 

i 

because = for such states. For general states, we have provided an 
outer bound on the set of achievable rates based on the multiparty squashed 
entanglement. In this section, we outline some other aspects of the multiparty 
FQSW protocol and its relation to other protocols. 

First, we note that there is an alternative, more compact way of writing 
the rate sum inequalities of Theorem 5.2 and Theorem 5.3. Consider the 
inequalities of the inner bound (5.2) reproduced below: 



2 



V/C C {!,..., m}. (5.44) 



^H(A k ) + H{R) - H(RA K ) 

.keK 

The term on the right hand side can be expressed as a multiparty information 
^Qk> \l(A- K] R), V/C C {1, ... , m}, (5.45) 

where I{A-^\ R) is the multiparty information of all the members of /C and 
R. The multiparty information function is naturally suited to the multiparty 
distributed compression problem. 

When only two parties are involved (m = 2), the inequalities in (5.44) 
reduce to the two-party bounds on distributed compression presented in [5]: 

Q 1 >h(A 1 ;R), 

Q2>\l{A 2] R), (5.46) 
Qi + Q 2 > \ [H(A 1 ) + H(A 2 ) + H{A 1 A 2 )} . 
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However, we now understand the mystery behind the expression that looks like 
the mutual information with a reversed sign: it is simply the form ^I(Ai, A 2 ; R), 
where H(R) = H(A 1 A 2 ) and H(A 1 A 2 R) = 0. The outer bound inequalities 
(5.3) similarly reduce to the corresponding expressions in the FQSW paper 
[5] with the multiparty squashed entanglement being replaced by the original 
two-party squashed entanglement of [17]. 

Another observation concerns the classical communication cost of the pro- 
tocol. If we move away from the "fully quantum" regime and allow classical 
communication between the senders and the receiver we can achieve better 
rates. We do this by recycling the entanglement generated by the FQSW pro- 
tocol. For two parties, the combination of multiparty FQSW of equation (5.46) 
with teleportation reproduces the state merging results of equation (3.21) 

R A > H(A\B) p , 

R B > H(B\A) p , (5.47) 

R a + Rb> H{AB) p . 

Finally we note that the multiparty FQSW protocol can be operated 
backwards in time to produce an optimal reverse Shannon theorem for the 
quantum broadcast channel [57]. 



CHAPTER 6 

Possible applications to the black hole information paradox 

There are very few physical systems that require both the application 
of the principles of general relativity and of quantum mechanics in order to 
understand them. Black holes fall into this category. Classically, a black hole 
is a region of space where the gravity is so strong that nothing can escape its 
pull - not even light. However, according to a certain semi-classical calculation 
performed by Hawking [73], black holes emit thermal radiation at a very slow 
rate. Thus, while it may take a very long time, all the mass/energy that fell 
into the black hole will eventually be released back into the universe and the 
black hole will evaporate. 

This scenario poses a serious problem known as the black hole information 
paradox. Consider a universe originally in the pure state | Universe) which 
collapses onto itself to form a black hole. After a very long time, the black hole 
evaporates completely to leave behind a universe filled with thermal radiation, 
which corresponds to the maximally mixed state. Herein lies the paradox: an 
initially pure state has evolved to a mixed state — something which violates 
the laws of unitary evolution so central to all of quantum theory. 

Does gravity lead to non-unitary evolution or is general relativity incom- 
plete? Over the last 30 years, many preeminent physicists have had some- 
thing to say about this question and yet this paradox still defies explanation 
[74, 75, 76]. What is worse is that the more we think about the information 
paradox the more we realize that it is not an "unwarranted extrapolation from 
an untrustworthy approximation" [74] but rather a true paradox of physics that 
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cannot be explained yet. True paradoxes of this kind are indicators that the 
scientific theories we use do not provide a complete description of reality. 

The black hole information paradox is yet to be explained in a satisfactory 
manner by modern physics and perhaps will not be until a theory of quantum 
gravity is developed. Recently, however, interesting contributions to the black 
hole information problem have been made by people from within the quantum 
information community [77, 78, 79, 80, 81]. In the last chapter of this thesis, we 
present a curious and counter-intuitive result about the nature of purifications 
and then use this observation to make a speculative comment about black 
holes with highly mixing internal dynamics. 

6.1 Polygamy of purification 

In a closing remark of the original FQSW paper [5], the authors make a 
very interesting observation about the nature of quantum purifications which 
we will refer to as polygamy of purification. Consider three parties — Alice, 
Bob and Ron who share the quantum state 



where |<&) denotes the maximally entangled state |$) = -^(|00) + |11)). In 
other words, Alice shares n entangled states with Ron and another n maxi- 
mally entangled states with Bob. The entanglement structure is illustrated in 
Figure 6-1 a). 

Now, we tell Alice to perform the standard FQSW task, that is, to transfer 
her R entanglement to Bob. Suppose that Alice performs the standard FQSW 




(6.1) 
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Figure 6-1: Transfer of quantum correlations between three parties: (a) The 
original AR and AB entanglement, (b) The effect of Alice sending the system 
A 1 to Bob. She is completely decoupled from the R system, (c) Alternatively, 
Alice can send the same A\ system to Ron and completely decouple from Bob! 

protocol in order to accomplish the entanglement transfer! She applies a 
random unitary to the system A n and then sets aside a subsystem A\ of 
dimension d,A 1 where 

\ogd Al > \I(A- R)^ = n [qubits] (6.2) 

as required by equation (3.25) for the FQSW protocol. Sending the system A\ 
to Bob will successfully decouple Alice from Ron and lead to the entanglement 
configuration illustrated in Figure 6-1 b). 

Note, however, that the encoding operation was not specifically targeting 
Bob. Indeed, if the same A\ system is sent to Ron instead, we would transfer 
the Bob entanglement to him and obtain the configuration of Figure 6-1 c). 
The polygamy of purification, therefore, is the observation that it is possible 
for a single quantum system A\ to contain the purification of more than one 
other system! 



1 Since in our setup the -R-entangled part of her system is clearly identifiable, 
another approach for Alice could be to simply take the n Ron-entangled qubits 
and send them to Bob. 
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6.2 Random internal dynamics for black holes 

Recently, the results of the FQSW protocol were connected to the black 
hole information paradox [79]. The question studied is not about the evolution 
of the universe as whole but something more specific. If we drop half of pure 
state |y?) ABl into a black hole, denoted B 2 , how long will it take for the its 
purification to come out? 

Under the assumption that the internal dynamics of the black hole corre- 
spond to a random unitary operation, a situation which was considered pre- 
viously in [76], we can give an answer to this question since it corresponds to 
an FQSW-type of problem except for the Schumacher compression step. We 
model the internal black hole dynamics as a random unitary U B which takes 
the system B = B\B 2 to an isomorphic system B'R, where R is released as 
radiation and B' is what remains of the black hole. The rest of the universe 
is denoted U and no assumptions are made about its size. The situation is 
illustrated in Figure 6-2. 




Figure 6-2: a) Black hole before the radiative process has taken place. The 
purification of the A system, B±, is somewhere inside the black hole. The 
system U denotes the rest of the universe, i.e. everything that is not A or B. 
b) After the black hole emits the radiation chunk R the remainder of the black 
hole is labeled B'. 

Inspired by the FQSW results, we can say that if the dimension of the 
radiated system satisfies 

logd R > \I{A- B) = \I{A- B ± ) = H(A) (6.3) 
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then, with high probability, it will contain the purification of the A system. 
This is because we can think of the black hole as an active entity mixing its 
internal degrees of freedom. 

In the current setup, we do not have the luxury of working in the i.i.d. 
regime so the statements we make are nothing more than inspired hand wav- 
ing arguments. Nevertheless, our calculation leads us to speculate that the 
purification information of a specific system will come out fairly fast and inde- 
pendently of the size of the black hole. In fact, since the system we labeled A 
was arbitrary, the purification of all subsystems of the universe with the same 
dimension comes out with the radiation R\ This is not be so surprising since 
we already know about the polygamy of purification. Nevertheless, even if the 
purification of any particular system of interest comes out quickly, we still have 
to wait until all of the black hole evaporates to recover the the purification of 
the whole universe, so the original black hole paradox remains. 

It is not clear what we mean when we say that the black hole has "internal 
dynamics". To assume that something interesting happens at the horizon is 
OK perhaps, but aren't black holes supposed to trap systems forever? 

6.3 Lost subsystem problem 

Consider now a similar situation to the above but this time the black 
hole consists of two systems B 2 L, where the L system is "lost" ; nothing ever 
leaves L. Half of a pure state \f) ABl is dropped into the black hole which 
is assumed to have random unitary dynamics on the space B = B\B 2 from 
which a system R is emitted. Once more we label B' the remainder of the 
black hole as illustrated in Figure 6-3. 

We would like to know how big the R system has to be in order for the 
purification of A to come out. This time, there are two active "participants": 
B and L, so the multiparty FQSW results have to be considered. Thus, in 
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U 

A 

• *-B 


Figure 6-3: a) The lost subsystem L is part of the black hole BL. The system 
U denotes the rest of the universe, b) The black hole has released radiation 
R from the B subsystem. The remainder of the black hole is B'L. 

order for the purification of A to come out the dimension of the radiated 
systems have to satisfy 

logd R > \I(B;A) = \I{B 1 ;A) = H{A), 
\ogd Rh > 1 1 I(L;A) = 0, (6.4) 
\ogd R + \ogd RL > \I{L-B-A)=H{A) + \I{B 2 -L). 

where d Rh is the dimension of the system released by the lost system. 

At first sight, all seems to be in order since the requirement logc^ > 
is satisfied. The inequality for the sum of the rates, however, adds an extra 
requirement for d R . To see the purification of A come out we will have to wait 
until 

\ogd R > m&x{H(A), H{A) + \I(B 2 ;L)}. (6.5) 

Thus, if the are any significant correlations between the B 2 and L parts 
of the black hole the information will not not come out quickly. This result 
is very interesting because the purification of A will be slow to come out even 
though it is held in the B part of the black hole and hasn't completely fallen 
into the L system. 




CHAPTER 7 
Conclusion 

This thesis has been an expedition into the field of quantum information 
science with many twists and turns. We began by introducing the funda- 
mental principles of classical information theory and their extensions to the 
quantum realm. Armed with the basics, we were ready to approach some of 
last decade's important results in quantum information theory with the aim 
of getting readers from outside the field up to speed. 

We then attacked the multiparty distributed compression problem with 
the most powerful weapon available in our arsenal: the fully quantum Slepian- 
Wolf protocol. The construction of the multiparty distributed compression 
protocol is conceptually simple. It consists of sequential applications of the 
two-party FQSW protocol with careful accounting of the information theoretic 
quantities at each step. However, in order to achieve rigorous proofs of the 
bounds on the multiparty rate region, we had to wage a heavy battle in difficult 
but interesting terrain. 

To achieve a rigorous proof of Theorem 5.2, the inner bound on the rate 
region, we had to dig into the geometry of convex polyhedra in m-dimensional 
space. The proof we obtained uses a sufficient level of mathematical abstrac- 
tion so as to apply to other problems in information theory involving multi- 
party rate regions proved in terms of achievable points but expressed instead 
in terms of facet inequalities. Indeed, our proof is valid for all supermodular 
rate regions, that is, all rate region specified by a set of inequalities 

^Rk>C k , V/CC{l,...,m} (7.1) 

k&K 

90 



91 

for which the constants Ctc satisfy the supermodular condition C/cuc + Cicnc > 
Ck + Cc- In particular, the rate regions for the classical multiparty Slepian- 
Wolf problem [33, 34] and the multiparty state merging protocol [43] fall into 
this category because of strong subadditivity. 

Also, in order to prove Theorem 5.3, the outer bound on the rate region, 
it was necessary to formulate a definition of the multiparty information and 
from it derive a multiparty generalization of the squashed entanglement. In 
the chapter dedicated to the multiparty squashed entanglement, we showed 
that it is a continuous, convex and subadditive measure of entanglement - 
all desirable but rare properties in the multiparty case. 

Some open problems remain which could form fruitful directions for future 
investigations. The additivity of the multiparty squashed entanglement is 
an important conjecture that was recently proved in an updated version of 
[16], which now includes W. Song in the author list. As for the distributed 
compression problem, we have fully solved the problem only for separable 
states. Perhaps a different correction term exists for the outer bound? If we 
find states for which we can calculate E sq analytically or numerically we could 
use them to further probe the shape of the outer bound. Of course, the black 
hole information paradox remains an open problem since it hasn't been solved 
by our toy-model observations. 

And so, we add the new weapon of mass decoupling to the ever growing 
collection of quantum information theory protocols derived from the nearly- 
universal building block of two-party FQSW. At the time of writing of this 
thesis, this collection contains entanglement distillation, channel simulation, 
communication over quantum broadcast channels, and many others. In fact, 
even the more general state redistribution [55] result can be obtained from the 
FQSW protocol [82]. 
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